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TRANSCRIPTIONAL ACTIVATOR NUCLEIC ACIDS, POLYPEPTIDES 
AND METHODS OF USE THEREOF 

TECHNICAL FIELD 

The present invention relates generally to plant molecular biology. More 
specifically, it relates to nucleic acids and methods for modulating their 
expression in plants. 

BACKGROUND OF THE INVENTION 

Major advances in plant transformation have occurred over the last few 
years. However, in major crop plants, such as maize and soybeans, serious 
genotype limitations still exist. Transformation of agronomically important maize 
inbred lines continues to be both difficult and time consuming. Traditionally, the 
only way to elicit a culture response has been by optimizing medium components 
and/or explant material and source. This has led to success in some genotypes, 
but most elite hybrids fail to produce a favorable culture response. While, 
transformation of model genotypes is efficient, the process of introgressing 
transgenes into production inbreds is laborious, expensive and time consuming. 
It would save considerable time and money if genes could be introduced into and 
evaluated directly in commercial hybrids. 

Current methods for genetic engineering in maize require a specific cell 
type as the recipient of new DNA. These cells are found in relatively 
undifferentiated, rapidly growing callus cells or on the scutellar surface of the 
immature embryo (which gives rise to callus). Irrespective of the delivery method 
currently used, DNA is introduced into literally thousands of cells, yet 
transformants are recovered at frequencies of 10* relative to transiently- 
expressing cells. Exacerbating this problem, the trauma that accompanies DNA 
introduction directs recipient cells into cell cycle arrest and accumulating 
evidence suggests that many of these cells are directed into apoptosis or 
programmed cell death. (Reference Bowen et al, Third International Congress of 
the International Society for Plant Molecular Biology, 1991, Abstract 1093). 
Therefore it would be desirable to provide improved methods capable of 
increasing transformation efficiency in a number of cell types. 
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Typically a selectable marker is used to recover transformed cells. 
Traditional selection schemes expose all cells to a phytotoxic agent and rely on 
the introduction of a resistance gene to recover transformants. Unfortunately, the 
presence of dying cells may reduce the efficiency of stable transformation. It 
5 would therefore be useful to provide a positive selection system for recovering 
transformants. 

In spite of increases in yield and harvested area worldwide, it is predicted 
that over the next ten years, meeting the demand for corn will require an 
additional 20% increase over current production (Dowswell, C.R., Paliwal, R.L., 

1 0 Cantrell, R.P., 1 996, Maize in the Third World, Westview Press, Boulder, CO). 

In hybrid crops, including grains, oil seeds, forages, fruits and vegetables, 
there are problems associated with the development and production of hybrid 
seeds. The process of cross-pollination of plants is laborious and expensive. In 
the cross-pollination process, the female plant must be prevented from being 

1 5 fertilized by its own pollen. Many methods have been developed over the years, 
such as detasseling in the case of corn, developing and maintaining male sterile 
lines, and developing plants that are incompatible with their own pollen, to name 
a few. Since hybrids do not breed true, the process must be repeated for the 
production of every hybrid seed lot. 

20 To further complicate the process, inbred lines are crossed. For example 

in the case of corn, the inbreds can be low yielding. This provides a major 
challenge in the production of hybrid seed corn. In fact, certain hybrids cannot be 
commercialized at all due to the performance of the inbred lines. The production 
of hybrid seeds is consequently expensive, time consuming and provides known 

25 and unknown risks. It would therefore be valuable to develop new methods which 
contribute to the increase of production efficiency of hybrid seed. 

As new traits are added to commercial crops by means of genetic 
engineering, problems arise in "stacking" traits. In order to develop heritable 
stacked traits, the traits must be linked because of segregating populations. 

30 Improved methods for developing hybrid seed which would not require linking of 
the traits would significantly shorten the time for developing commercial hybrid 
seeds. 
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Gene silencing is another problem in developing heritable traits with 
genetic engineering. Frequently gene silencing is seen following meiotic 
divisions. Elimination or reduction of this problem would advance the state of 
science and industry in this area. 

5 

SUMMARY OF THE INVENTION 

It is the object of the present invention to provide nucleic acids and 
polypeptides relating to embryogenesis. 

It is another object of the present invention to provide nucleic acids and 
10 polypeptides that can be used to identify interacting proteins involved in 
transcription regulation in embryogenesis. 

It is another object of the present invention to provide antigenic fragments 
of the polypeptides of the present invention. 

It is another object of the present invention to provide transgenic plants 
1 5 and plant parts containing the nucleic acids of the present invention. 

It is another object of the present invention to provide methods for 
modulating, in a transgenic plant, the expression of the nucleic acids of the 
present invention. 

It is another object of the present invention to provide a method for 
20 improving transformation frequencies. 

It is another object of the present invention to provide a method for 
improving transformation efficiency in cells from various sources. 

It is another object of the present invention to provide a method for a 
positive selection system. 
25 It is another object of the present invention to provide a method for 

efficiently producing hybrid seed via apomixis. 

It is another object of the present invention to provide a method for 
stacking traits which does not require linking of traits. 

It is another object of the present invention to provide a method for 
30 reducing the problem of gene silencing. 

The present invention relates to a HAP3-type CCAAT-box binding 
transcriptional activator polynucleotides and polypeptides, and in particular, the 
leafy cotyledon 1 transcriptional activator (LEC1) polynucleotides and 
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polypeptides. In other aspects the present invention relates to expression 
cassettes optionally linked in antisense orientation, host cells transfected with at 
least one expression cassette, and transgenic plants and seeds comprising the 
expression cassettes. Further aspects of the invention include methods of using 
5 the polynucleotides and polypeptides. In a further aspect, the present invention 
relates to a method of modulating expression of the polynucleotides encoding the 
polypeptides of the present invention in a plant. Expression of the 
polynucleotides encoding the proteins of the present invention can be increased 
or decreased relative to a non-transformed control plant. 

10 

BRIEF DESCRIPTION OF THE DRAWING 

Figure 1 depicts the comparison of various sequences and the alignment of 
the conserved regions. 

15 DETAILED DESCRIPTION OF THE INVENTION 

DEFINITIONS 

The term "isolated" refers to material, such as a nucleic acid or a protein, 

which is: (1) substantially or essentially free from components which normally 

accompany or interact with the material as found in its naturally occurring 
20 environment or (2) if the material is in its natural environment, the material has 

been altered by deliberate human intervention to a composition and/or placed at a 

locus in the cell other than the locus native to the material. 

As used herein, "nucleic acid" means a polynucleotide and includes single 

or double-stranded polymer of deoxyribonucleotide or ribonucleotide bases. 
25 Nucleic acids may also include modified nucleotides that permit correct read 

through by a polymerase and do not alter the expression of a polypeptide 

encoded by the polynucleotide. 

As used herein, "LEC1 nucleic acid" means a nucleic acid or 

polynucleotide that codes for a LEC1 polypeptide. 
30 As used herein, "polypeptide" means proteins, protein fragments, modified 

proteins, amino acid sequences and synthetic amino acid sequences. The 

polypeptide can be glycosylated or not. 
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As used herein, "LEC1 polypeptide" means a HAP3 family member, 
CCAAT-box binding transcriptional activator polypeptide that regulates gene 
expression during embryo development, and that contains the conserved 
sequence set out in SEQ ID NO: 23. 

As used herein, "plant" includes plants and plant parts including but not 
limited to plant cells, plant tissue such as leaves, stems, roots, flowers, and 
seeds. 

As used herein, "promoter" includes reference to a region of DNA upstream 
from the start of transcription and involved in recognition and binding of RNA 
polymerase and other proteins to initiate transcription. 

By "fragment" is intended a portion of the nucleotide sequence or a portion 
of the amino acid sequence and hence protein encoded thereby. Fragments of a 
nucleotide sequence may encode protein fragments that retain the biological 
activity of the native nucleic acid. Alternatively, fragments of a nucleotide 
sequence that are useful as hybridization probes generally do not encode 
fragment proteins retaining biological activity. Thus, fragments of a nucleotide 
sequence are generally greater than 20, 30, 50, 100, 150, 200 or 300 nucleotides 
and up to the entire nucleotide sequence encoding the proteins of the invention. 
Generally the probes are less than 1000 nucleotides and preferably less than 500 
nucleotides. Fragments of the invention include antisense sequences used to 
decrease expression of the inventive polynucleotides. Such antisense fragments 
may vary in length ranging from at least about 20 nucleotides, about 50 
nucleotides, about 100 nucleotides, up to and including the entire coding 
sequence. 

By "functional equivalent" as applied to a polynucleotide or a protein is 
intended a polynucleotide or a protein of sufficient length to modulate the level of 
LEC1 protein activity in a plant cell. A polynucleotide functional equivalent can 
be in sense or antisense orientation. 

By "variants" is intended substantially similar sequences. Generally, 
nucleic acid sequence variants of the invention will have at least 60%, 65%, or 
70%, preferably 75%, 80% or 90%, more preferably at least 95% and most 
preferably at least 98% sequence identity to the native nucleotide sequence, 
wherein the % sequence identity is based on the entire sequence and is 
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determined by GAP analysis using Gap Weight of 50 and Length Weight of 3. 
Generally, polypeptide sequence variants of the invention will have at least about 
50%. 55%, 60%, 65%, 70%, 75% or 80%, preferably at least about 85% or 90%, 
and more preferably at least about 95% sequence identity to the native protein, 
wherein the % sequence identity is based on the entire sequence and is 
determined by GAP analysis using Gap Weight of 12 and Length Weight of 4. 

A "responsive plant cell" or "responsive host cell" refers to a cell that 
exhibits a positive response to the introduction of LEC1 polypeptide or LEC1 
polynucleotide compared to a cell that has not been introduced with LEC1 
polypeptide or LEC1 polynucleotide. The response can be to enhance tissue 
culture response, induce somatic embryogenesis, induce apomixis, increase 
transformation efficiency or increase recovery of regenerated plants. 

A "recalcitrant plant cell" is a responsive plant cell that generally does not 
exhibit a positive response such as tissue culture response, transformation 
efficiency or recovery of regenerated plants. 

NUCLEIC ACIDS 

The present invention relates to a HAP3-type CCAAT-box binding 
transcriptional activators, and in particular, the leafy cotyledon 1 transcriptional 
activator (LEC1 ). Expression of the LEC1 polynucleotide initiates formation of 
embryo-like structures and improves growth and recovery of transformants. The 
term apomixis is used to describe asexual reproduction that replaces or 
substitutes sexual methods of reproduction. When apomixis occurs, embryos are 
produced from maternal tissue and use only the maternal genome. 

In particular the present invention relates to an isolated nucleic acid 
comprising a member selected from the group consisting of: 

(a) a polynucleotide which encodes a polypeptide of SEQ ID NO: 2, 8, 
10, 12, 14, 16, 18,20, or22; 

(b) a polynucleotide amplified from a plant nucleic acid library using the 
primers of SEQ ID NOS: 3 and 4, 5 and 6, 9 and 10, or 1 1 and 12 or 
primers determined by using Vector nti Suite, InforMax Version 5. 

(c) a polynucleotide comprising at least 20 contiguous bases of SEQ ID 
NO: 1,7, 9,11,13, 15,17, 19, or 21; 
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(d) a polynucleotide encoding a plant HAP3-type ccaat-box transcriptional 
activator with the conserved motif of SEQ ID NO: 23, wherein the 
polynucleotide is from a plant other than Arabidopsis] 

(e) a polynucleotide having at least 60% sequence identity to SEQ ID NO: 
5 1, 9, 11, 13, 17, or 21 or 65% sequence identity to SEQ ID NO: 15 or 

19 or 70% sequence identity to SEQ ID NO: 7, wherein the % 
sequence identity is based on the entire sequence and is determined 
by GAP analysis using Gap Weight of 50 and Length Weight of 3; 

(f) a polynucleotide comprising at least 25 nucleotides in length which 
10 hybridizes under high stringency conditions to a polynucleotide 

having the sequence set forth in SEQ ID NO: 1 , 7, 9, 1 1 , 1 3, 1 5, 1 7, 
19, or 21; 

(g) a polynucleotide encoding the protein of SEQ ID NO: 2, 8, 10, 12, 
14, 16, 18, 20, or 22, wherein the polynucleotide is from a plant 

1 5 other than Arabidopsis] 

(h) a polynucleotide having the sequence set forth in SEQ ID NO: 1 , 7, 9, 
11, 13, 15, 17, 19, or 21; and 

(i) a polynucleotide complementary to a polynucleotide of (a) through 
(h). 

20 In many cases of apomixis maternal tissues such as the nucellus or inner 

integument "bud off" producing somatic embryos. These embryos then develop 
normally into seed. Since meiosis and fertilization are circumvented, the plants 
developing from such seed are genetically identical to the maternal plant. 
Expression of the leafy cotyledon 1 gene in the nucellus integument, or cell 

25 specific expression in the megaspore mother cell would trigger embryo formation 
from maternal tissues. 

Producing a seed identical to the parent has many advantages. For 
example high yielding hybrids could be used in seed production to multiply 
identical copies of high yielding hybrid seed. This would greatly reduce seed cost 

30 as well as increase the number of genotypes which are commercially available. 
Genes can be evaluated directly in commercial hybrids since the progeny would 
not segregate. This would save years of back crossing. 
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Apomixis would also provide a method of containment of transgenes when 
coupled with male sterility. The construction of male sterile autonomous 
agamospermy would prevent genetically engineered traits from hybridizing with 
weedy relatives. 

Gene stacking would be relatively easy with apomixis. Hybrids could be 
successively re-transformed with various new traits and propagated via apomixis. 
The traits would not need to be linked since apomixis avoids the problems 
associated with segregation. 

Apomixis can provide a reduction in gene silencing. Gene silencing is 
frequently seen following meiotic divisions. Since meiotic divisions never occur, it 
may be possible to eliminate or reduce the frequency of gene silencing. Apomixis 
can also be used stabilize desirable phenotypes with complex traits such as 
hybrid vigor. Such traits could easily be maintained and multiplied indefinitely via 
apomixis. 

The Cauliflower Mosaic Virus 35S promoter has been used to overexpress 
LEC1 during Agrobacfem/m-mediated in planta transformation of Arabidopsis 
(Harada ef a/., WO 98/37184). As pointed out by Harada et a/., 35S is a strong 
promoter, and in their experiments it was found that 35S:LEC1 did not improve 
transformation and actually appeared to hinder it (transformation efficiency with 
35S:LEC1 was estimated to be 0.6% of that obtained normally). Thus, 
overexpression in a cell type such as those in the gametophytic stage of 
development may be inappropriate and detrimental to the transformation process 
and successful recovery of transformed progeny. In contrast, we have shown that 
ectopic expression of the LEC1 gene under the appropriate control elements 
(including tissue specific and/or inducible promoters) and in the appropriate plant 
cells can be used to stimulate embryo formation in tissues/genotypes normally not 
amenable to culture. Likewise ectopic expression in genotypes amenable to 
culture can increase the number of embryo precursor cells (or increase the 
number that develop into embryos) leading to an increase in transformation 
frequency. Transient expression using RNA or protein may be sufficient to initiate 
the cascade of events leading to embryo formation. This would be valuable in 
such target tissues as maize scutella, immature leaf bases, immature tassels, etc. 
The LEC1 gene could be used as a positive selectable marker, i.e. triggering 
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embryogenesis in transgenic cells without killing the surrounding wild-type cells. 
This would happen since the cells receiving the introduced gene would undergo 
embryogenesis or in tissues already undergoing embryogenesis LEC1 expression 
would stimulate more rapid reiteration of somatic embryos. 

It has been shown through sequence similarity that the Arabidopsis LEC1 
polypeptide is homologous to the HAP3 subunit of the "CCAAT-box binding 
factor" class of eukaryotic transcriptional activators (Lotan et ai, 1998, Cell 
93:1 195-1205). This class of proteins, which consist of Hap2/3/4 and 5, form a 
heteroligomeric transcriptional complex, that appears to activate specific gene 
sets in eukaryotes. Certain members of this family such as Hap2 and Hap5 
appear to be ubiquitously expressed, while different Hap3 members are under 
developmental or environmental regulation. Plant HAP3 polypeptides can be 
recognized by a high degree of sequence identity to other HAP3 homologs in the 
"B domain" of the protein. For example, the B domain for the Arabidopsis LEC1 , 
from amino acid residue 28 to residue 117, shares between 55% and 63% identity 
(75-85% similarity) to other members of the HAP3 family, including maize (HAP3), 
chicken, lamprey, Xenopus, human, mouse, Emericella nidulens, 
Schizosaccharomyces pombe, Saccharomyces cerevisiae and Kluuyveromyces 
lactis (Lotan et ai, 1998). 

Expression of the LEC1 gene in transformed cells initiates embryo 
development and stimulates development of pre-existing embryos. Normally, 
LEC1 expression is necessary for proper embryo maturation in the latter stages of 
embryo development, and LEC1 transgene expression thus may also promote 
these processes. The combined effect of these impacts on somatic 
embryogenesis is not only to stimulate growth of transformed cells, but also to 
insure that transformed somatic embryos develop in a normal, viable fashion 
(increasing the capacity of transformed somatic embryos to germinate vigorously). 
Continued ectopic overexpression beyond embryo maturation may negatively 
impact germination and vegetative plant growth (which may necessitate down- 
regulation of the LEC1 transgene during these stages of development. 

Expression of the LEC1 gene will stimulate growth in cells with the 
potential to initiate or maintain embryogenic growth. Cells in established 
meristems or meristem-derive cell lineages may be less prone to undergo the 
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transition to embryos. In addition, transformation methods that target certain 
reproductive tissues (or cells) such as vacuum-infiltration of Agrobacterium into 
Arabidopsis may have detrimental effects on recovery of transformants (triggering 
genes associated with embryogenesis may disrupt the proper functioning of these 
5 cells). 

The polypeptides encoded by the present plant LEC1 genes can be 
distinguished from non-LEC HAP3 proteins by using the diagnostic motif shown in 
SEQ ID NO: 23. 

The isolated nucleic acids of the present invention can be made using (a) 

1 0 standard recombinant methods, (b) synthetic techniques, or combinations thereof. 
In some embodiments, the polynucleotides of the present invention will be cloned, 
amplified, or otherwise constructed from a monocot or dicot. In preferred 
embodiments the monocot is corn, sorghum, barley, wheat, millet, or rice. 
Preferred dicots include soybeans, sunflower, canola, alfalfa, potato, or cassava. 

15 Functional fragments included in the invention can be obtained using 

primers which selectively hybridize under stringent conditions. Primers are 
generally at least 12 bases in length and can be as high as 200 bases, but will 
generally be from 15 to 75, preferably from 15 to 50. Functional fragments can be 
identified using a variety of techniques such as restriction analysis, Southern 

20 analysis, primer extension analysis, and DNA sequence analysis. 

The present invention includes a plurality of polynucleotides that encode 
for the identical amino acid sequence. The degeneracy of the genetic code 
allows for such "silent variations" which can be used, for example, to selectively 
hybridize and detect allelic variants of polynucleotides of the present invention. 

25 Additionally, the present invention includes isolated nucleic acids comprising 

allelic variants. The term "allele" as used herein refers to a related nucleic acid of 
the same gene. 

Variants of nucleic acids included in the invention can be obtained, for 
example, by oligonucleotide-directed mutagenesis, linker-scanning mutagenesis, 
30 mutagenesis using the polymerase chain reaction, and the like. See, for example, 
Ausubel, pages 8.0.3 - 8.5.9. Also, see generally, McPherson (ed.), DIRECTED 
MUTAGENESIS: A Practical Approach, (IRL Press, 1991). Thus, the present 
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invention also encompasses DNA molecules comprising nucleotide sequences 
that have substantial sequence similarity with the inventive sequences. 

Variants included in the invention may contain individual substitutions, 
deletions or additions to the nucleic acid or polypeptide sequences which alters, 
5 adds or deletes a single amino acid or a small percentage of amino acids in the 
encoded sequence is a "conservatively modified variant" where the alteration 
results in the substitution of an amino acid with a chemically similar amino acid. 
When the nucleic acid is prepared or altered synthetically, advantage can be 
taken of known codon preferences of the intended host. 

10 The present invention also includes "shufflents" produced by sequence 

shuffling of the inventive polynucleotides to obtain a desired characteristic. 
Sequence shuffling is described in PCT publication No. 96/19256. See also, 
Zhang, J. H., etai., Proc. Natl. Acad Sci. USA 94:4504-4509 (1997). 

The present invention also includes the use of 5' and/or 3' UTR regions for 

1 5 modulation of translation of heterologous coding sequences. Positive sequence 
motifs include translational initiation consensus sequences (Kozak, Nucleic Acids 
Res. 15:81 25 (1987)) and the 7-methylguanosine cap structure (Drummond et al, 
Nucleic Acids Res. 13:7375 (1985)). Negative elements include stable 
intramolecular 5' UTR stem-loop structures (Muesing et al., Cell 48:691 (1987)) 

20 and AUG sequences or short open reading frames preceded by an appropriate 
AUG in the 5' UTR (Kozak, supra, Rao et ai, Mol. and Cell. Biol. 8:284 (1988)). 

Further, the polypeptide-encoding segments of the polynucleotides of the 
present invention can be modified to alter codon usage. Altered codon usage can 
be employed to alter translational efficiency. Codon usage in the coding regions 

25 of the polynucleotides of the present invention can be analyzed statistically using 
commercially available software packages such as "Codon Preference" available 
from the University of Wisconsin Genetics Computer Group (see Devereaux et ai, 
Nucleic Acids Res. 12:387-395 (1 984)) or MacVector 4.1 (Eastman Kodak Co., 
New Haven, Conn.). 

30 For example, the inventive nucleic acids can be optimized for enhanced 

expression in plants of interest. See, for example, EPA0359472; W091/16432; 
Perlak et al. (1991) Proc. Natl. Acad. Sci. USA 88:3324-3328; and Murray etai 
(1989) Nucleic Acids Res. 17:477-498. In this manner, the polynucleotides can 
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be synthesized utilizing plant-preferred codons. See, for example, Murray et al. 
(1989) Nucleic Acids Res. 1 7:477-498, the disclosure of which is incorporated 
herein by reference. 

The present invention provides subsequences comprising isolated nucleic 
acids containing at least 16 contiguous bases of the inventive sequences. For 
example the isolated nucleic acid includes those comprising at least 16, 20, 25, 
30, 40, 50, 60, 75 or 100 contiguous nucleotides of the inventive sequences. 
Subsequences of the isolated nucleic acid can be used to modulate or detect 
gene expression by introducing into the subsequences compounds which bind, 
intercalate, cleave and/or crosslink to nucleic acids. 

The nucleic acids of the invention may conveniently comprise a multi- 
cloning site comprising one or more endonuclease restriction sites inserted into 
the nucleic acid to aid in isolation of the polynucleotide. Also, translatable 
sequences may be inserted to aid in the isolation of the translated polynucleotide 
of the present invention. For example, a hexa-histidine marker sequence 
provides a convenient means to purify the proteins of the present invention. 

A polynucleotide of the present invention can be attached to a vector, 
adapter, promoter, transit peptide or linker for cloning and/or expression of a 
polynucleotide of the present invention. Additional sequences may be added to 
such cloning and/or expression sequences to optimize their function in cloning 
and/or expression, to aid in isolation of the polynucleotide, or to improve the 
introduction of the polynucleotide into a cell. Use of cloning vectors, expression 
vectors, adapters, and linkers is well known and extensively described in the art. 
For a description of such nucleic acids see, for example, Stratagene Cloning 
Systems, Catalogs 1995, 1996, 1997 (La Jolla, CA); and, Amersham Life 
Sciences, Inc. Catalog '97 (Arlington Heights, IL). 

The isolated nucleic acid compositions of this invention, such as RNA, 
cDNA, genomic DNA, or a hybrid thereof, can be obtained from plant biological 
sources using any number of cloning methodologies known to those of skill in the 
art. In some embodiments, oligonucleotide probes which selectively hybridize, 
under stringent conditions, to the polynucleotides of the present invention are 
used to identify the desired sequence in a cDNA or genomic DNA library. 
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Exemplary total RNA and mRNA isolation protocols are described in Plant 
Molecular Biology: A Laboratory Manual, Clark, Ed., Springer-Verlag, Berlin 
(1997); and, Current Protocols in Molecular Biology, Ausubel, etai, Eds., Greene 
Publishing and Wiley-lnterscience, New York (1995). Total RNA and mRNA 
isolation kits are commercially available from vendors such as Stratagene (La 
Jolla, CA), Clonetech (Palo Alto, CA), Pharmacia (Piscataway, NJ), and 5*-3' 
(Paoli, PA). See also, U.S. Patent Nos. 5,614,391; and, 5,459,253. 

Typical cDNA synthesis protocols are well known to the skilled artisan and 
are described in such standard references as: Plant Molecular Biology: A 
Laboratory Manual, Clark, Ed., Springer-Verlag, Berlin (1997); and, Current 
Protocols in Molecular Biology, Ausubel, et al., Eds., Greene Publishing and 
Wiley-lnterscience, New York (1995). cDNA synthesis kits are available from a 
variety of commercial vendors such as Stratagene or Pharmacia. 

An exemplary method of constructing a greater than 95% pure full-length 
cDNA library is described by Carninci et ai, Genomics, 37:327-336 (1996). Other 
methods for producing full-length libraries are known in the art. See, e.g., Edery 
etai, Mol. Cell Biol. 1 5(6):3363-3371 (1995); and PCT Application WO 96/34981. 

It is often convenient to normalize a cDNA library to create a library in 
which each clone is more equally represented. A number of approaches to 
normalize cDNA libraries are known in the art. Construction of normalized 
libraries is described in Ko, Nucl. Acids. Res. 18(19):5705-571 1 (1990); Patanjali 
etai., Proc. Natl. Acad. U.S.A. 88:1943-1947 (1991); U.S. Patents 5,482,685 and 
5,637,685; and Scares era/., Proc. Natl. Acad. Sci. USA 91:9228-9232 (1994). 

Subtracted cDNA libraries are another means to increase the proportion of 
less abundant cDNA species. See, Foote et al. in, Plant Molecular Biology: A 
Laboratory Manual, Clark, Ed., Springer-Verlag, Berlin (1997); Kho and Zarbl, 
Technique 3(2):58-63 (1991); Sive and St. John, Nucl. Acids Res. 16(22): 10937 
(1988); Current Protocols in Molecular Biology, Ausubel, et al., Eds., Greene 
Publishing and Wiley-lnterscience, New York (1995); and, Swaroop et al., Nucl. 
Acids Res. 19(8): 1954 (1991). cDNA subtraction kits are commercially available. 
See, e.g., PCR-Select (Clontech). 

To construct genomic libraries, large segments of genomic DNA are 
generated by random fragmentation. Examples of appropriate molecular 
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biological techniques and instructions are found in Sambrook, et al., Molecular 
Cloning: A Laboratory Manual, 2nd Ed., Cold Spring Harbor Laboratory, Vols. 1-3 
(1989), Methods in Enzymology, Vol. 152: Guide to Molecular Cloning 
Techniques, Berger and Kimmel, Eds., San Diego: Academic Press, Inc. (1987), 
Current Protocols in Molecular Biology, Ausubel, et al., Eds., Greene Publishing 
and Wiley-lnterscience, New York (1995); Plant Molecular Biology: A Laboratory 
Manual, Clark, Ed., Springer-Verlag, Berlin (1997). Kits for construction of 
genomic libraries are also commercially available. 

The cDNA or genomic library can be screened using a probe based upon 
the sequence of a nucleic acid of the present invention such as those disclosed 
herein. Probes may be used to hybridize with genomic DNA or cDNA sequences 
to isolate homologous polynucleotides in the same or different plant species. 
Those of skill in the art will appreciate that various degrees of stringency of 
hybridization can be employed in the assay; and either the hybridization or the 
wash medium can be stringent. The degree of stringency can be controlled by 
temperature, ionic strength, pH and the presence of a partially denaturing solvent 

such as formamide. 

Typically, stringent hybridization conditions will be those in which the salt 
concentration is less than about 1.5 M Na ion. typically about 0.01 to 1 .0 M Na ion 
concentration (or other salts) at pH 7.0 to 8.3 and the temperature is at least 
about 30 6 C for short probes (e.g., 10 to 50 nucleotides) and at least about 60°C 
for long probes (e.g., greater than 50 nucleotides). Stringent conditions may also 
be achieved with the addition of destabilizing agents such as formamide. 

For purposes of defining the invention the following conditions are 
provided. Exemplary low stringency conditions include hybridization with a buffer 
solution of 30 to 35% formamide, 1 M NaCI, 1% SDS (sodium dodecyl sulfate) at 
37°C, and a wash in 1X to 2X SSC (20X SSC = 3.0 M NaCI/0.3 M trisodium 
citrate) at 50°C. Exemplary moderate stringency conditions include hybridization 
in 40 to 45% formamide, 1 M NaCI, 1 % SDS at 37»C, and a wash in 0.5X to 1 X 
SSC at 55°C. Exemplary high stringency conditions include hybridization in 50% 
formamide, 1 M NaCI, 1% SDS at 37'C, and a wash in 0.1X SSC at 60°C. 
Typically the time of hybridization is from 4 to 16 hours. 
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An extensive guide to the hybridization of nucleic acids is found in Tijssen, 
Laboratory Techniques in Biochemistry and Molecular Biology-Hybridization with 
Nucleic Acid Probes, Part I, Chapter 2 "Overview of principles of hybridization and 
the strategy of nucleic acid probe assays", Elsevier, New York (1 993); and 
Current Protocols in Molecular Biology, Chapter 2, Ausubel, et a/.. Eds., Greene 
Publishing and Wiley-lnterscience, New York (1995). Often, cDNA libraries will 
be normalized to increase the representation of relatively rare cDNAs. 

The nucleic acids of the invention can be amplified from nucleic acid 
samples using amplification techniques. For instance, polymerase chain reaction 
(PCR) technology can be used to amplify the sequences of polynucleotides of the 
present invention and related polynucleotides directly from genomic DNA or 
cDNA libraries. PCR and other in vitro amplification methods may also be useful, 
for example, to clone nucleic acid sequences that code for proteins to be 
expressed, to make nucleic acids to use as probes for detecting the presence of 
the desired mRNA in samples, for nucleic acid sequencing, or for other purposes. 

Examples of techniques useful for in vitro amplification methods are found 
in Berger, Sambrook, and Ausubel, as well as Mullis et a/., U.S. Patent No. 
4,683,202 (1 987); and, PCR Protocols A Guide to Methods and Applications, Innis 
et a/., Eds., Academic Press Inc., San Diego, CA (1990). Commercially available 
kits for genomic PCR amplification are known in the art. See, e.g., Advantage-GC 
Genomic PCR Kit (Clontech). The T4 gene 32 protein (Boehringer Mannheim) 
can be used to improve yield of long PCR products. PCR-based screening 
methods have also been described. Wilfinger et al. describe a PCR-based 
method in which the longest cDNA is identified in the first step so that incomplete 
clones can be eliminated from study. BioTechniques, 22(3):481-486 (1997). 

In one aspect of the invention, nucleic acids can be amplified from a plant 
nucleic acid library. The nucleic acid library may be a cDNA library, a genomic 
library, or a library generally constructed from nuclear transcripts at any stage of 
intron processing. Libraries can be made from a variety of plant tissues. Good 
results have been obtained using mitotically active tissues such as shoot 
meristems, shoot meristem cultures, embryos, callus and suspension cultures, 
immature ears and tassels, and young seedlings. The cDNAs of the present 
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invention were obtained from immature zygotic embryo and regenerating callus 
libraries. 

Alternatively, the sequences of the invention can be used to isolate 
corresponding sequences in other organisms, particularly other plants, more 
particularly, other monocots. In this manner, methods such as PCR, 
hybridization, and the like can be used to identify such sequences having 
substantial sequence similarity to the sequences of the invention. See, for 
example, Sambrook et al. (1989) Molecular Cloning: A Laboratory Manual (2d ed., 
Cold Spring Harbor Laboratory Press, Plainview, New York), and Innis et al. 
0 (1990), PCR Protocols: A Guide to Methods and Applications (Academic Press, 
New York). Coding sequences isolated based on their sequence identity to the 
entire inventive coding sequences set forth herein or to fragments thereof are 
encompassed by the present invention. 

The isolated nucleic acids of the present invention can also be prepared by 
5 direct chemical synthesis by methods such as the phosphotriester method of 
Narang et al., Meth. Enzymol. 68.90-99 (1 979); the phosphodiester method of 
Brown era/., Meth. Enzymol. 68:109-151 (1979); the diethylphosphoramidite 
method of Beaucage et al., Tetra. Lett. 22:1859-1862 (1981); the solid phase 
phosphoramidite Wester method described by Beaucage and Caruthers, Tetra. 
20 Letts. 22(20):1859-1862 (1981), e.g., using an automated synthesizer, e.g., as 
described in Needham-VanDevanter etal. Nucleic Acids Res. 12:6159-6168 
(1984); and, the solid support method of U.S. Patent No. 4,458,066. Chemical 
synthesis generally produces a single stranded oligonucleotide. This may be 
converted into double stranded DNA by hybridization with a complementary 
25 sequence, or by polymerization with a DNA polymerase using the single strand as 
a template. One of skill will recognize that while chemical synthesis of DNA is 
limited to sequences of about 100 bases, longer sequences may be obtained by 
the ligation of shorter sequences. 

The nucleic acids of the present invention include those amplified using the 
30 following primer pairs: SEQ ID NOS: 3 and 4, 5 and 6, 9 and 10, or 11 and12or 
primers determined by using Vector nti Suite, InforMax Version 5. 
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EXPRESSION CASSETTES 

In another embodiment expression cassettes comprising isolated nucleic 
acids of the present invention are provided. An expression cassette will typically 
comprise a polynucleotide of the present invention operably linked to 
transcriptional initiation regulatory sequences which will direct the transcription of 
the polynucleotide in the intended host cell, such as tissues of a transformed 
plant. 

The construction of such expression cassettes which can be employed in 
conjunction with the present invention is well known to those of skill in the art in 
light of the present disclosure. See, e.g., Sambrook, et a/.; Molecular Cloning: A 
Laboratory Manual; Cold Spring Harbor, New York; (1 989); Gelvin, ef a/. ; Plant 
Molecular Biology Manual (1 990); Plant Biotechnology: Commercial Prospects 
and Problems, eds. Prakash, et a/.; Oxford & IBH Publishing Co.; New Delhi. 
India- (1993); and Heslot, et a/.; Molecular Biology and Genetic Engineering of 
Yeasts; CRC Press, Inc., USA; (1992); each incorporated herein in its entirety by 
reference. 

For example, plant expression vectors may include (1) a cloned plant gene 
under the transcriptional control of 5' and 3" regulatory sequences and (2) a 
dominant selectable marker. Such plant expression vectors may also contain, if 
desired a promoter regulatory region (e.g., one conferring inducible, constitute, 
environmentally- or deve.opmental.y-regu.ated, or cell- or tissue-specif.c/select.ve 
expression), a transcription initiation start site, a ribosome binding site, an RNA 
processing signal, a transcription termination site, and/or a polyadenylation 
signal. 

Constitutive, tissue-preferred or inducible promoters can be employed. 
Examples of constitutive promoters include the cauliflower mosaic virus (CaMV) 
35S transcription initiation region, the V- or 2'- promoter derived from T-DNA of 
Agrobaderium tumefaciens, the actin promoter, the ubiquitin promoter, the 
histone H2B promoter (Nakayama et a/., 1992. FEBS Lett 30:167-170), the Smas 
promoter, the cinnamyl alcohol dehydrogenase promoter (U.S. Patent No. 
5,683,439), the Nos promoter, the pEmu promoter, the rubisco promoter, the 
GRP1-8 promoter, and other transcription initiation regions from various plant 
genes known in the art. 
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Examples of inducible promoters are the Adh1 promoter which is inducible 
by hypoxia or cold stress, the Hsp70 promoter which is inducible by heat stress, 
the PPDK promoter which is inducible by light, the In2 promoter which is safener 
induced, the ERE promoter which is estrogen induced and the Pepcarboxylase 

5 promoter which is light induced. 

Examples of promoters under developmental control include promoters that 
initiate transcription preferentially in certain tissues, such as leaves, roots, fruit, 
seeds, or flowers. An exemplary promoter is the anther specific promoter 5126 
(U.S. Patent Nos. 5,689,049 and 5,689,051). Examples of seed-preferred 

10 promoters include, but are not limited to, 27 kD gamma zein promoter and waxy 
promoter, Boronat, A., Martinez, M.C.. Reina, M., Puigdomenech, P. and Palau, 
j • isolation and sequencing of a 28 kD glutelin-2 gene from maize: Common 
elements in the 5' flanking regions among zein and glutelin genes; Plant Sci. 
47:95-102 (1986) and Reina, M.. Ponte, I., Guillen, P., Boronat, A. and Palau, J.. 

15 Sequence analysis of a genomic clone encoding a Zc2 protein from Zea mays 
W64 A. Nucleic Acids Res. 18(21):6426 (1990). See the following site relating to 
the waxy promoter: Kloesgen, R.B., Gierl.A., Schwarz-Sommer, Z.S. and Saedler, 
H., Molecular analysis of the waxy locus of Zea mays, Mol. Gen. Genet. 203:237- 
244 (1986). The disclosures each of these are incorporated herein by reference 

20 in their entirety. 

Preferably a weak constitutive promoter, such as the Nos promoter, an 
inducible promoter, such as In2, or a nucellus-preferred or integument-preferred 
promoter are used to induce apospory. For example the barley or maize Nud 
promoter, the maize Cim 1 promoter or the maize LTP2 promoter can be used to 
25 preferentially express in the nucellus. See for example US Serial No. 60/097,233 
filed August 20, 1998 the disclosure of which is incorporated herein by reference. 

Either heterologous or non-heterologous (i.e., endogenous) promoters can 
be employed to direct expression of the nucleic acids of the present invention. 
These promoters can also be used, for example, in expression cassettes to drive 
30 expression of antisense nucleic acids to reduce, increase, or alter concentration 
and/or composition of the proteins of the present invention in a desired tissue. 

If polypeptide expression is desired, it is generally desirable to include a 
polyadenylation region at the 3--end of a polynucleotide coding region. The 
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polyadenylation region can be derived from the natural gene, from a variety of 
other plant genes, or from T-DNA. The 3' end sequence to be added can be 
derived from, for example, the nopaline synthase or octopine synthase genes, or 
alternatively from another plant gene, or less preferably from any other eukaryotic 
gene. 

An intron sequence can be added to the 5' untranslated region or the 
coding sequence of the partial coding sequence to increase the amount of the 
mature message that accumulates. See for example Buchman and Berg, Mol. 
Cell Biol. 8:4395-4405 (1988); Callis et a/., Genes Dev. 1 :1 183-1200 (1987). Use 
of maize introns Adh1-S intron 1, 2, and 6, the Bronze-1 intron are known in the 
art. See generally, The Maize Handbook, Chapter 1 16, Freeling and Walbot, 
Eds., Springer, New York (1994). 

The vector comprising the sequences from a polynucleotide of the present 
invention will typically comprise a marker gene which confers a selectable 
phenotype on plant cells. Usually, the selectable marker gene will encode 
antibiotic or herbicide resistance. Suitable genes include those coding for 
resistance to the antibiotics spectinomycin and streptomycin (e.g., the aada 
gene), the streptomycin phosphotransferase (SPT) gene coding for streptomycin 
resistance, the neomycin phosphotransferase (NPTII) gene encoding kanamycin 
or geneticin resistance, the hygromycin phosphotransferase (HPT) gene coding 
for hygromycin resistance. 

Suitable genes coding for resistance to herbicides include those which act 
to inhibit the action of acetolactate synthase (ALS), in particular the 
sulfonylurea-type herbicides (e.g., the acetolactate synthase (ALS) gene 
containing mutations leading to such resistance in particular the S4 and/or Hra 
mutations), those which act to inhibit action of glutamine synthase, such as 
phosphinothricin or basta (e.g., the bar gene), or other such genes known in the 
art. The bar gene encodes resistance to the herbicide basta and the ALS gene 
encodes resistance to the herbicide chlorsulfuron. While useful in conjunction 
with the above antibiotic and herbicide-resistance selective markers (i.e. use of 
the LEC1 gene can increase transformation frequencies when using chemical 
selection), a prefered use of LEC1 expression takes advantage of this gene 
conferring a growth advantage to transformed cells without the need for inhibitory 
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compounds to retard non-transformed growth. Thus, LEC1 transformants are 
recovered based solely on their differential growth advantage. 

Typical vectors useful for expression of genes in higher plants are well 
known in the art and include vectors derived from the tumor-inducing (Ti) plasmid 
of Agrobacterium tumefaciens described by Rogers era/., Meth. In Enzymol. 
153:253-277 (1987). Exemplary A. tumefaciens vectors useful herein are 
plasmids pKYLXS and pKYLX7 of Schardl ef a/., Gene, 61:1-1 1 (1987) and Berger 
et ai, Proc. Natl. Acad. Sci. USA 86:8402-8406 (1989). Another useful vector 
herein is plasmid pBI101.2 that is available from Clontech Laboratories, Inc. (Palo 
Alto, CA). 

A variety of plant viruses that can be employed as vectors are known in the 
art and include cauliflower mosaic virus (CaMV), geminivirus, brome mosaic virus, 
and tobacco mosaic virus. 

A polynucleotide of the present invention can be expressed in either sense 
or anti-sense orientation as desired. In plant cells, it has been shown that 
antisense RNA inhibits gene expression by preventing the accumulation of mRNA 
which encodes the enzyme of interest, see, e.g., Sheehy ef a/., Proc. Natl. Acad. 
Sci. USA 85:8805-8809 (1988); and Hiatt ef a/., U.S. Patent No. 4,801,340. 

Another method of suppression is sense suppression. Introduction of 
nucleic acid configured in the sense orientation has been shown to be an 
effective means by which to block the transcription of target genes. For an 
example of the use of this method to modulate expression of endogenous genes 
see, Napoli ef a/., The Plant Cell 2:279-289 (1990) and U.S. Patent No. 
5,034,323. 

Recent work has shown suppression with the use of double stranded RNA. 
Such work is described in Tabara et ai, Science 282:5388:430-431 (1998). 

Catalytic RNA molecules or ribozymes can also be used to inhibit 
expression of plant genes. The inclusion of ribozyme sequences within antisense 
RNAs confers RNA-cleaving activity upon them, thereby increasing the activity of 
the constructs. The design and use of target RNA-specific ribozymes is 
described in Haseloff ef a/., Nature 334:585-591 (1988). 

A variety of cross-linking agents, alkylating agents and radical generating 
species as pendant groups on polynucleotides of the present invention can be 
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used to bind, label, detect, and/or cleave nucleic acids. For example, Vlassov, V. 
V., et ai, Nucleic Acids Res (1986) 14:4065^076, describe covalent bonding of a 
single-stranded DNA fragment with alkylating derivatives of nucleotides 
complementary to target sequences. A report of similar work by the same group 
is that by Knorre, D. G., et ai, Biochimie (1985) 67:785-789. Iverson and Dervan 
also showed sequence-specific cleavage of single-stranded DNA mediated by 
incorporation of a modified nucleotide which was capable of activating cleavage 
(J. Am. Chem. Soc. (1987) 109:1241-1243). Meyer, R. B., etal., J. Am. Chem. 
Soc. (1 989) 1 1 1 :851 7-851 9, effect covalent crosslinking to a target nucleotide 
using an alkylating agent complementary to the single-stranded target nucleotide 
sequence. A photoactivated crosslinking to single-stranded oligonucleotides 
mediated by psoralen was disclosed by Lee, B. L, et ai, Biochemistry (1988) 
27:31 97-3203. Use of crosslinking in triple-helix forming probes was also 
disclosed by Home, etal., J. Am. Chem. Soc. (1990) 112:2435-2437. Use of N4, 
N4-ethanocytosine as an alkylating agent to crosslink to single-stranded 
oligonucleotides has also been described by Webb and Matteucci, J. Am. Chem. 
Soc. (1986) 108:2764-2765; Nucleic Acids Res (1986) 14:7661-7674; Feteritz et 
ai, J. Am. Chem. Soc. 113:4000 (1991). Various compounds to bind, detect, 
label, and/or cleave nucleic acids are known in the art. See, for example, U.S. 
Patent Nos. 5,543,507; 5,672,593; 5,484,908; 5,256,648; and, 5,681941. 

Proteins 

In another aspect, the invention relates to an isolated protein comprising a 
member selected from the group consisting of: 

(a) a polypeptide comprising at least 25 contiguous amino acids of SEQ 
ID NO: 2, 8, 10, 12, 14, 16, 18, 20, or 22; 

(b) a polypeptide which is a plant HAP3-type CCATT-box binding 
transcriptional activator that regulates gene expression during 
embryo development and maturation; 

(c) a polypeptide comprising at least 60% sequence identity to SEQ ID 
NO: 2, 12, 14, 16, 20, or 22, or 70 % sequence identity to SEQ ID NO: 
8, 10, or 18, wherein the % sequence identity is based on the entire 
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sequence and is determined by GAP analysis using Gap Weight of 12 
and Length Weight of 4; 

(d) a polypeptide encoded by a nucleic acid of claim 1 ; 

(e) a polypeptide encoded by a nucleic acid of SEQ ID NO: 1 , 7, 9, 1 1 , 
13, 15, 17, 19, or 21; and 

(f) a polypeptide having the sequence set forth in SEQ ID NO: 2, 8, 10, 
12, 14, 16, 18, 20, or 22. 



Proteins of the present invention include proteins derived from the native 
1 0 protein by deletion (so-called truncation), addition or substitution of one or more 
amino acids at one or more sites in the native protein. Such variants may result 
from, for example, genetic polymorphism or from human manipulation. Methods 
for such manipulations are generally known in the art. 

For example, amino acid sequence variants of the polypeptide can be 
1 5 prepared by mutations in the cloned DNA sequence encoding the native protein 
of interest. Methods for mutagenesis and nucleotide sequence alterations are 
well known in the art. See, for example, Walker and Gaastra, eds. (1983) 
Techniques in Molecular Biology (MacMillan Publishing Company, New York); 
Kunkel (1985) Proc. Natl. Acad. Sci. USA 82:488^92; Kunkel etal. (1987) 
20 Methods Enzymol. 1 54:367-382; Sambrook et al. (1 989) Molecular Cloning: A 
Laboratory Manual (Cold Spring Harbor, New York); U.S. Patent No. 4,873,192; 
and the references cited therein; herein incorporated by reference. Guidance as 
to appropriate amino acid substitutions that do not affect biological activity of the 
protein of interest may be found in the model of Dayhoff et al. (1978) Atlas of 

25 Protein Sequence and 

Structure (Natl. Biomed. Res. Found., Washington, D.C.), herein incorporated by 
reference. Conservative substitutions, such as exchanging one amino acid with 
another having similar properties, may be preferred. 

In constructing variants of the proteins of interest, modifications to the 

30 nucleotide sequences encoding the variants will be made such that variants 
continue to possess the desired activity. Obviously, any mutations made in the 
DNA encoding the variant protein must not place the sequence out of reading 
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frame and preferably will not create complementary regions that could produce 
secondary mRNA structure. See EP Patent Application Publication No. 75,444. 

The isolated proteins of the present invention include a polypeptide 
comprising at least 23 contiguous amino acids encoded by any one of the nucleic 
5 acids of the present invention, or polypeptides which are conservatively modified 
variants thereof. The proteins of the present invention or variants thereof can 
comprise any number of contiguous amino acid residues from a polypeptide of the 
present invention, wherein that number is selected from the group of integers 
consisting of from 23 to the number of residues in a full-length polypeptide of the 
10 present invention. Optionally, this subsequence of contiguous amino acids is at 
least 25, 30, 35, or 40 amino acids in length, often at least 50, 60, 70, 80, or 90 
amino acids in length. 

The present invention includes catalytically active polypeptides (i.e., 
enzymes). Catalytically active polypeptides will generally have a specific activity 
15 of at least 20%, 30%, or 40%, and preferably at least 50%, 60%, or 70%, and 
most preferably at least 80%, 90%, or 95% that of the native (non-synthetic), 
endogenous polypeptide. Further, the substrate specificity (KJK^) is optionally 
substantially similar to the native (non-synthetic), endogenous polypeptide. 
Typically, the K„, will be at least 30%, 40%, or 50%, that of the native (non- 
20 synthetic), endogenous polypeptide; and more preferably at least 60%, 70%, 
80%, or 90%. Methods of assaying and quantifying measures of enzymatic 
activity and substrate specificity (kc^), are well known to those of skill in the 
art. 

The present invention includes modifications that can be made to an 
25 inventive protein. In particular, it may be desirable to diminish the activity of the 
LEC1 gene. Other modifications may be made to facilitate the cloning, 
expression, or incorporation of the targeting molecule into a fusion protein. Such 
modifications are well known to those of skill in the art and include, for example, a 
methionine added at the amino terminus to provide an initiation site, or additional 
30 amino acids (e.g., poly His) placed on either terminus to create conveniently 
located restriction sites or termination codons or purification sequences. 

Using the nucleic acids of the present invention, one may express a protein 
of the present invention in a recombinantly engineered cell such as bacteria, 
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yeast, insect, mammalian, or preferably plant cells. The cells produce the protein 
in a non-natural condition (e.g., in quantity, composition, location, and/or time), 
because they have been genetically altered through human intervention to do so. 

Typically, an intermediate host cell will be used in the practice of this 
invention to increase the copy number of the cloning vector. With an increased 
copy number, the vector containing the gene of interest can be isolated in 
significant quantities for introduction into the desired plant cells. 

Host cells that can be used in the practice of this invention include 
prokaryotes, including bacterial hosts such as Eschericia coli. Salmonella 
typhimurium, and Serratia marcescens. Eukaryotic hosts such as yeast or 
filamentous fungi may also be used in this invention. Since these hosts are also 
microorganisms, it will be essential to ensure that plant promoters which do not 
cause expression of the polypeptide in bacteria are used in the vector. 

Commonly used prokaryotic control sequences include such commonly 
used promoters as the beta lactamase (penicillinase) and lactose (lac) promoter 
systems (Chang et a/., Nature 198:1056 (1977)), the tryptophan (trp) promoter 
system (Goeddel et a/., Nucleic Acids Res. 8:4057 (1980)) and the lambda 
derived P L promoter and N-gene ribosome binding site (Shimatake et a/., Nature 
292:128 (1981)). The inclusion of selection markers in DNA vectors transfected 
in E. coli is also useful. Examples of such markers include genes specifying 
resistance to ampicillin, tetracycline, or chloramphenicol. 

The vector is selected to allow introduction into the appropriate host cell. 
Bacterial vectors are typically of plasmid or phage origin. Expression systems for 
expressing a protein of the present invention are available using Bacillus sp. and 
Salmonella (Palva, etal., Gene 22:229-235 (1983); Mosbach, etal., Nature 
302:543-545(1983)). 

Synthesis of heterologous proteins in yeast is well known. See Sherman, 
F., et al.. Methods in Yeast Genetics, Cold Spring Harbor Laboratory (1982). Two 
widely utilized yeast for production of eukaryotic proteins are Saccharomyces 
cerevisiae and Pichia pastoris. Vectors, strains, and protocols for expression in 
Saccharomyces and Pichia are known in the art and available from commercial 
suppliers (e.g., Invitrogen). Suitable vectors usually have expression control 
sequences, such as promoters, including 3-phosphoglycerate kinase or alcohol 
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oxidase, and an origin of replication, termination sequences and the like as 
desired. 

A protein of the present invention, once expressed, can be isolated from 
yeast by lysing the cells and applying standard protein isolation techniques to the 
lysates. The monitoring of the purification process can be accomplished by using 
Western blot techniques or radioimmunoassay of other standard immunoassay 
techniques. 

The proteins of the present invention can also be constructed using non- 
cellular synthetic methods. Solid phase synthesis of proteins of less than about 
50 amino acids in length may be accomplished by attaching the C-terminal amino 
acid of the sequence to an insoluble support followed by sequential addition of 
the remaining amino acids in the sequence. Techniques for solid phase synthesis 
are described by Barany and Merrifield, Solid-Phase Peptide Synthesis, pp. 3-284 
in The Peptides: Analysis, Synthesis, Biology. Vol. 2: Special Methods in Peptide 
Synthesis, Part A.; Merrifield, et al., J. Am. Chem. Soc. 85:2149-2156 (1963), and 
Stewart et al., Solid Phase Peptide Synthesis, 2nd ed., Pierce Chem. Co., 
Rockford, III. (1984). Proteins of greater length may be synthesized by 
condensation of the amino and carboxy termini of shorter fragments. Methods of 
forming peptide bonds by activation of a carboxy terminal end (e.g., by the use of 
the coupling reagent N,N'-dicycylohexylcarbodiimide)) is known to those of skill. 

The proteins of this invention, recombinant or synthetic, may be purified to 
substantial purity by standard techniques well known in the art, including 
detergent solubilization, selective precipitation with such substances as 
ammonium sulfate, column chromatography, immunopurification methods, and 
others. See, for instance, R. Scopes, Protein Purification: Principles and Practice, 
Springer-Verlag: New York (1982); Deutscher, Guide to Protein Purification, 
Academic Press (1990). For example, antibodies may be raised to the proteins 
as described herein. Purification from E. coli can be achieved following 
procedures described in U.S. Patent No. 4,51 1 ,503. Detection of the expressed 
protein is achieved by methods known in the art and include, for example, 
radioimmunoassays, Western blotting techniques or immunoprecipitation. 

The present invention further provides a method for modulating (i.e., 
increasing or decreasing) the concentration or composition of the polypeptides of 
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the present invention in a plant or part thereof. Modulation can be effected by 
increasing or decreasing the concentration and/or the composition (i.e., the ratio 
of the polypeptides of the present invention) in a plant. 

The method comprises transforming a plant cell with an expression 
5 cassette comprising a polynucleotide of the present invention to obtain a 
transformed plant cell, growing the transformed plant cell under conditions 
allowing expression of the polynucleotide in the plant cell in an amount sufficient 
to modulate concentration and/or composition in the plant cell. 

In some embodiments, the content and/or composition of polypeptides of 

1 0 the present invention in a plant may be modulated by altering, in vivo or in vitro, 
the promoter of a non-isolated gene of the present invention to up- or down- 
regulate gene expression. In some embodiments, the coding regions of native 
genes of the present invention can be altered via substitution, addition, insertion, 
or deletion to decrease activity of the encoded enzyme. See, e.g., Kmiec, U.S. 

1 5 Patent 5,565,350; Zarling ef a/., PCT/US93/03868. One method of down- 
regulation of the protein involves using PEST sequences that provide a target for 
degradation of the protein. It has been observed that high levels of LEC1 prevent 
germination. See Lotan etal., Cell 1998 June 26; 93(7): 11 95-1 205. Thus, 
temporal regulation of LEC1 expression may be desirable in certain species to 

20 permit proper germination, vegetative growth, flowering and reproduction. 

In some embodiments, an isolated nucleic acid (e.g., a vector) comprising 
a promoter sequence is transfected into a plant cell. Subsequently, a plant cell 
comprising the promoter operably linked to a polynucleotide of the present 
invention is selected for by means known to those of skill in the art such as, but 

25 not limited to, Southern blot, DNA sequencing, or PCR analysis using primers 
specific to the promoter and to the gene and detecting ampl icons produced 
therefrom. A plant or plant part altered or modified by the foregoing embodiments 
is grown under plant forming conditions for a time sufficient to modulate the 
concentration and/or composition of polypeptides of the present invention in the 

30 plant. Plant forming conditions are well known in the art. 

In general, concentration or composition is increased or decreased by at 
least 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, or 90% relative to a 
native control plant, plant part, or cell lacking the aforementioned expression 
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cassette. Modulation in the present invention may occur during and/or 
subsequent to growth of the plant to the desired stage of development. 
Modulating nucleic acid expression temporally and/or in particular tissues can be 
controlled by employing the appropriate promoter operably linked to a 
5 polynucleotide of the present invention in, for example, sense or antisense 
orientation as discussed in greater detail, supra. Induction of expression of a 
polynucleotide of the present invention can also be controlled by exogenous 
administration of an effective amount of inducing compound. Inducible promoters 
and inducing compounds which activate expression from these promoters are well 

1 0 known in the art. In preferred embodiments, the polypeptides of the present 
invention are modulated in monocots or dicots, preferably maize, soybeans, 
sunflower, sorghum, canola, wheat, alfalfa, rice, barley and millet. 

Means of detecting the proteins of the present invention are not critical 
aspects of the present invention. In a preferred embodiment, the proteins are 

1 5 detected and/or quantified using any of a number of well recognized 

immunological binding assays (see, e.g., U.S. Patents 4,366,241; 4,376,110; 
4,517,288; and 4,837,168). For a review of the general immunoassays, see also 
Methods in Cell Biology, Vol. 37: Antibodies in Cell Biology, Asai, Ed., Academic 
Press, Inc. New York (1993); Basic and Clinical Immunology 7th Edition, Stites & 

20 Terr, Eds. (1 991 ). Moreover, the immunoassays of the present invention can be 
performed in any of several configurations, e.g., those reviewed in Enzyme 
Immunoassay, Maggio, Ed., CRC Press, Boca Raton, Florida (1980); Tijan, 
Practice and Theory of Enzyme Immunoassays, Laboratory Techniques in 
Biochemistry and Molecular Biology, Elsevier Science Publishers B.V., 

25 Amsterdam (1985); Harlow and Lane, supra; Immunoassay: A Practical Guide, 
Chan, Ed., Academic Press, Orlando, FL (1987); Principles and Practice of 
Immunoassays, Price and Newman Eds., Stockton Press, NY (1991); and Non- 
isotopic Immunoassays, Ngo, Ed., Plenum Press, NY (1988). 

Typical methods include Western blot (immunoblot) analysis, analytic 

30 biochemical methods such as electrophoresis, capillary electrophoresis, high 
performance liquid chromatography (HPLC), thin layer chromatography (TLC), 
hyperdiffusion chromatography, and the like, and various immunological methods 
such as fluid or gel precipitin reactions, immunodiffusion (single or double), 
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immunoelectrophoresis, radioimmunoassays (RIAs), enzyme-linked 
immunosorbent assays (ELISAs), immunofiuorescent assays, and the like. 

Non-radioactive labels are often attached by indirect means. Generally, a 
ligand molecule (e.g., biotin) is covalently bound to the molecule. The ligand then 
5 binds to an anti-ligand (e.g., streptavidin) molecule which is either inherently 
detectable or covalently bound to a signal system, such as a detectable enzyme, 
a fluorescent compound, or a chemiluminescent compound. A number of ligands 
and anti-ligands can be used. Where a ligand has a natural anti-ligand, for 
example, biotin, thyroxine, and Cortisol, it can be used in conjunction with the 

10 labeled, naturally occurring anti-ligands. Alternatively, any haptenic or antigenic 
compound can be used in combination with an antibody. 

The molecules can also be conjugated directly to signal generating 
compounds, e.g., by conjugation with an enzyme or fluorophore. Enzymes of 
interest as labels will primarily be hydrolases, particularly phosphatases, 

15 esterases and glycosidases, or oxidoreductases, particularly peroxidases. 

Fluorescent compounds include fluorescein and its derivatives, rhodamine and its 
derivatives, dansyl, umbelliferone, etc. Chemiluminescent compounds include 
luciferin, and 2,3-dihydrophthalazinediones, e.g., luminol. For a review of various 
labeling or signal producing systems which may be used, see, U.S. Patent No. 

20 4,391 ,904, which is incorporated herein by reference. 

Some assay formats do not require the use of labeled components. For 
instance, agglutination assays can be used to detect the presence of the target 
antibodies. In this case, antigen-coated particles are agglutinated by samples 
comprising the target antibodies. In this format, none of the components need be 

25 labeled and the presence of the target antibody is detected by simple visual 
inspection. 

The proteins of the present invention can be used for identifying 
compounds that bind to (e.g., substrates), and/or increase or decrease (i.e., 
modulate) the enzymatic activity of, catalytically active polypeptides of the present 
30 invention. The method comprises contacting a polypeptide of the present 

invention with a compound whose ability to bind to or modulate enzyme activity is 
to be determined. The polypeptide employed will have at least 20%, preferably at 
least 30% or 40%, more preferably at least 50% or 60%, and most preferably at 



WO 00/28058 PCT/US99/26514 

-29- 

least 70% or 80% of the specific activity of the native, full-length polypeptide of 
the present invention (e.g., enzyme). Methods of measuring enzyme kinetics are 
well known in the art. See, e.g., Segel, Biochemical Calculations, 2 nd ed., John 
Wiley and Sons, New York (1976). 
5 Antibodies can be raised to a protein of the present invention, including 

individual, allelic, strain, or species variants, and fragments thereof, both in their 
naturally occurring (full-length) forms and in recombinant forms. Additionally, 
antibodies are raised to these proteins in either their native configurations or in 
non-native configurations. Anti-idiotypic antibodies can also be generated. Many 

1 0 methods of making antibodies are known to persons of skill. 

In some instances, it is desirable to prepare monoclonal antibodies from 
various mammalian hosts, such as mice, rodents, primates, humans, etc. 
Description of techniques for preparing such monoclonal antibodies are found in, 
e.g., Basic and Clinical Immunology, 4th ed., Stites et al., Eds., Lange Medical 

15 Publications, Los Altos, CA, and references cited therein; Harlow and Lane, 
Supra; Goding, Monoclonal Antibodies: Principles and Practice, 2nded., 
Academic Press, New York, NY (1986); and Kohler and Milstein, Nature 256:495- 
497 (1975). 

Other suitable techniques involve selection of libraries of recombinant 
20 antibodies in phage or similar vectors (see, e.g., Huse et al., Science 246:1 275- 
1281 (1989); and Ward, etal., Nature 341:544-546 (1989); and Vaughan et al., 
Nature Biotechnology, 14:309-314 (1996)). Alternatively, high avidity human 
monoclonal antibodies can be obtained from transgenic mice comprising 
fragments of the unrearranged human heavy and light chain Ig loci (i.e., minilocus 
25 transgenic mice). Fishwild et al., Nature Biotech., 14:845-851 (1996). Also, 
recombinant immunoglobulins may be produced. See, Cabilly, U.S. Patent No. 
4,816,567; and Queen etal., Proc. Natl. Acad. Sci. 86:10029-10033 (1989). 

The antibodies of this invention can be used for affinity chromatography in 
isolating proteins of the present invention, for screening expression libraries for 
30 particular expression products such as normal or abnormal protein or for raising 
anti-idiotypic antibodies which are useful for detecting or diagnosing various 
pathological conditions related to the presence of the respective antigens. 
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Frequently, the proteins and antibodies of the present invention will be 
labeled by joining, either covalently or non-covalently, a substance which 
provides for a detectable signal. A wide variety of labels and conjugation 
techniques are known and are reported extensively in both the scientific and 
5 patent literature. Suitable labels include radionuclides, enzymes, substrates, 
cofactors, inhibitors, fluorescent moieties, chemiluminescent moieties, magnetic 
particles, and the like. 

Transfection/Transformation of Cells 

10 The method of transformation/transfection is not critical to the instant 

invention; various methods of transformation or transfection are currently 
available. As newer methods are available to transform crops or other host cells 
they may be directly applied. Accordingly, a wide variety of methods have been 
developed to insert a DNA sequence into the genome of a host cell to obtain the 

15 transcription and/or translation of the sequence to effect phenotypic changes in 
the organism. Thus, any method which provides for efficient 
transformation/transfection may be employed. 

A DNA sequence coding for the desired polynucleotide of the present 
invention, for example a cDNA or a genomic sequence encoding a full length 

20 protein, can be used to construct an expression cassette which can be introduced 
into the desired plant. Isolated nucleic acid acids of the present invention can be 
introduced into plants according techniques known in the art. Generally, 
expression cassettes as described above and suitable for transformation of plant 
cells are prepared. 

25 Techniques for transforming a wide variety of higher plant species are well 

known and described in the technical, scientific, and patent literature. See, for 
example, Weising et a/., Ann. Rev. Genet 22:421-477 (1988). For example, the 
DNA construct may be introduced directly into the genomic DNA of the plant cell 
using techniques such as electroporation, PEG poration, particle bombardment, 

30 silicon fiber delivery, or microinjection of plant cell protoplasts or embryogenic 
callus. See, e.g., Tomes, et a/., Direct DNA Transfer into Intact Plant Cells Via 
Microprojectile Bombardment, pp. 197-21 3 in Plant Cell, Tissue and Organ 
Culture, Fundamental Methods, eds. O. L Gamborg and G.C. Phillips. Springer- 
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Verlag Berlin Heidelberg New York, 1995. Alternatively, the DNA constructs may 
be combined with suitable T-DNA flanking regions and introduced into a 
conventional Agrobactehum tumefaciens host vector. The virulence functions of 
the Agrobactehum tumefaciens host will direct the insertion of the construct and 
5 adjacent marker into the plant cell DNA when the cell is infected by the bacteria. 
See, U.S. Patent No. 5,591,616. 

The introduction of DNA constructs using polyethylene glycol precipitation 
is described in Paszkowski et a/., Embo J. 3:2717-2722 (1984). Electroporation 
techniques are described in Fromm et a/., Proc. Natl. Acad, ScL 82:5824 (1985). 
10 Ballistic transformation techniques are described in Klein et al., Nature 327:70-73 

(1987) . 

Agrobactehum tumefaciens-medltaied transformation techniques are well 
described in the scientific literature. See, for example Horsch et a/ M Science 
233:496-498 (1984), and Fraley etai, Proc. Natl. Acad. Sci. 80:4803 (1983). For 

15 instance, Agrobactehum transformation of maize is described in WO 98/32326. 
Agrobactehum transformation of soybean is described in US Pat. No. 5,563,055. 

Other methods of transfection or transformation include (1 ) Agrobactehum 
rhizogenes-mediated transformation (see, e.g., Lichtenstein and Fuller In: Genetic 
Engineering, Vol. 6, PWJ Rigby, Ed. , London, Academic Press, 1987; and 

20 Lichtenstein, C. P., and Draper, J,. In: DNA Cloning, Vol. II, D. M. Glover, Ed., 
Oxford, IRI Press, 1985), Application PCT/US87/02512 (WO 88/02405 published 
Apr. 7, 1988) describes the use of A. rhizogenes strain A4 and its Ri plasmid 
along with A. tumefaciens vectors pARC8 or pARC16 (2) liposome-mediated DNA 
uptake (see, e.g., Freeman e/a/., Plant Cell Physiol. 25:1353, (1984)), (3) the 

25 vortexing method (see, e.g., Kindle, Proc. Natl. Acad. Sci. USA 87:1228, (1990)). 

DNA can also be introduced into plants by direct DNA transfer into pollen 
as described by Zhou ef a/., Methods in Enzymology, 101:433 (1983); D. Hess, 
Intern Rev. Cytol., 107:367 (1987); Luo etai, Plane Mol. Biol. Reporter, 6:165 

(1988) . Expression of polypeptide coding polynucleotides can be obtained by 
30 injection of the DNA into reproductive organs of a plant as described by Pena et 

a/., Nature, 325:274 (1987). DNA can also be injected directly into the cells of 
immature embryos and the rehydration of desiccated embryos as described by 
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Neuhaus et a/., Theor. Appl. Genet., 75:30 (1987); and Benbrook et a/., in 
Proceedings Bio Expo 1986, Butterworth, Stoneham, Mass., pp. 27-54 (1986). 

Animal and lower eukaryotic (e.g., yeast) host cells are competent or 
rendered competent for transfection by various means. There are several well- 
5 known methods of introducing DNA into animal cells. These include; calcium 
phosphate precipitation, fusion of the recipient cells with bacterial protoplasts 
containing the DNA t treatment of the recipient cells with liposomes containing the 
DNA, DEAE dextran, electroporation, biolistics, and micro-injection of the DNA 
directly into the cells. The transfected cells are cultured by means well known in 
10 the art. Kuchler, RJ., Biochemical Methods in Cell Culture and Virology, Dowden, 
Hutchinson and Ross, Inc. (1977). 

Altering the culture medium to suppress somatic embryogenesis in non- 
transformed plant cells and/or tissues to provide for a positive section 
1 5 means of transformed plant cells 

Using the following methods for controlling somatic embryogenesis, it is 
possible to alter plant tissue culture media components to suppress somatic 
embryogenesis in a plant species of interest (often having multiple components 
that potentially could be adjusted to impart this effect). Such conditions would 

20 not impart a negative or toxic in vitro environment for wild-type tissue, but instead 
would simply not produce a somatic embryogenic growth form. Introducing a 
transgene such as LEC1 will stimulate somatic embryogenesis and growth in the 
transformed cells or tissue, providing a clear differential growth screen useful for 
identifying transformants. 

25 Altering a wide variety of media components can modulate somatic 

embryogenesis (either stimulating or suppressing embryogenesis depending on 
the species and particular media component). Examples of media components 
which, when altered, can stimulate or suppress somatic embryogenesis include; 

1 ) the basal medium itself (macronutrient, micronutrients and vitamins; see T. A. 
30 Thorpe, 1981 for review, "Plant Tissue Culture: Methods and Applications in 

Agriculture", Academic Press, NY), 

2) plant phytohormones such as auxins (indole acetic acid, indole butyric acid, 
2,4-dichlorophenoxyacetic acid, naphthaleneacetic acid, picloram, dicamba 
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and other functional analogues), cytokinins (zeatin, kinetin. benzyl amino 
purine, 2-isopentyl adenine and functionally-related compounds) abscisic acid, 
adenine, and gibberellic acid, 

3) and other compounds that exert "growth regulator" effects such as coconut 
5 water, casein hydrolysate, and proline, and 

4) the type and concentration of gelling agent, pH and sucrose concentration. 
Changes in the individual components listed above (or in some cases 

combinations of components) have been demonstrated in the literature to 
modulate in vitro somatic embryogenesis across a wide range of dicotyledonous 
10 and monocotyledonous species. For a compilation of examples, see E.F. George 
et al. 1987. Plant Tissue Culture Media. Vol. 1: Formulations and Uses. 
Exergetics, Ltd., Publ., Edington, England. 

Transgenic Plant Regeneration 

1 5 Transformed plant cells which are derived by any of the above 

transformation techniques can be cultured to regenerate a whole plant which 
possesses the transformed genotype. Such regeneration techniques often rely on 
manipulation of certain phytohormones in a tissue culture growth medium, 
typically relying on a biocide and/or herbicide marker that has been introduced 

20 together with a polynucleotide of the present invention. For transformation and 
regeneration of maize see, Gordon-Kamm et al., The Plant Cell, 2:603-618 
(1990). 

Plants cells transformed with a plant expression vector can be 
regenerated, e.g., from single cells, callus tissue or leaf discs according to 

25 standard plant tissue culture techniques. It is well known in the art that various 
cells, tissues, and organs from almost any plant can be successfully cultured to 
regenerate an entire plant. Plant regeneration from cultured protoplasts is 
described in Evans et al., Protoplasts Isolation and Culture, Handbook of Plant 
Cell Culture, Macmillan Publishing Company, New York, pp. 124-176 (1983); and 

30 Binding, Regeneration of Plants, Plant Protoplasts, CRC Press, Boca Raton, pp. 
21-73(1985). 

The regeneration of plants containing the foreign gene introduced by 
Agrobacterium can be achieved as described by Horsch et al., Science, 



WO 00/28058 PCT/US99/26514 

-34- 

227:1229-1231 (1985) and Fraley et a/., Proa Natl. Acad. Set. U.S.A. 80:4803 
(1983). This procedure typically produces shoots within two to four weeks and 
these transformant shoots are then transferred to an appropriate root-inducing 
medium containing the selective agent and an antibiotic to prevent bacterial 
5 growth. Transgenic plants of the present invention may be fertile or sterile. 

Regeneration can also be obtained from plant callus, explants, organs, or 
parts thereof. Such regeneration techniques are described generally in Klee et 
a/., Ann. Rev. of Plant Phys. 38:467-486 (1987). The regeneration of plants from 
either single plant protoplasts or various explants is well known in the art. See, 

1 0 for example, Methods for Plant Molecular Biology, A. Weissbach and H. 

Weissbach, eds., Academic Press, Inc., San Diego, Calif. (1988). For maize cell 
culture and regeneration see generally, The Maize Handbook, Freeling and 
Walbot, Eds., Springer, New York (1994); Corn and Corn Improvement 3 rd 
edition, Sprague and Dudley Eds., American Society of Agronomy, Madison, 

15 Wisconsin (1988). 

One of skill will recognize that after the expression cassette is stably 
incorporated in transgenic plants and confirmed to be operable, it can be 
introduced into other plants by sexual crossing. Any of a number of standard 
breeding techniques can be used, depending upon the species to be crossed. 

20 In vegetatively propagated crops, mature transgenic plants can be 

propagated by the taking of cuttings, via production of apomictic seed, or by 
tissue culture techniques to produce multiple identical plants. Selection of 
desirable transgenics is made and new varieties are obtained and propagated 
vegetatively for commercial use. In seed propagated crops, mature transgenic 

25 plants can be self crossed to produce a homozygous inbred plant. The inbred 
plant produces seed containing the newly introduced heterologous nucleic acid. 
These seeds can be grown to produce plants that would produce the selected 
phenotype. 

Parts obtained from the regenerated plant, such as flowers, seeds, leaves, 
30 branches, fruit, and the like are included in the invention, provided that these 

parts comprise cells comprising the isolated nucleic acid of the present invention. 
Progeny and variants, and mutants of the regenerated plants are also included 
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within the scope of the invention, provided that these parts comprise the 
introduced nucleic acid sequences. 

Transgenic plants expressing a selectable marker can be screened for 
transmission of the nucleic acid of the present invention by, for example, standard 
5 immunoblot and DNA detection techniques. Transgenic lines are also typically 
evaluated on levels of expression of the heterologous nucleic acid. Expression at 
the RNA level can be determined initially to identify and quantitate expression- 
positive plants. Standard techniques for RNA analysis can be employed and 
include PCR amplification assays using oligonucleotide primers designed to 

10 amplify only the heterologous RNA templates and solution hybridization assays 
using heterologous nucleic acid-specific probes. The RNA-positive plants can 
then analyzed for protein expression by Western immunoblot analysis using the 
specifically reactive antibodies of the present invention. In addition, in situ 
hybridization and immunocytochemistry according to standard protocols can be 

1 5 done using heterologous nucleic acid specific polynucleotide probes and 

antibodies, respectively, to localize sites of expression within transgenic tissue. 
Generally, a number of transgenic lines are usually screened for the incorporated 
nucleic acid to identify and select plants with the most appropriate expression 
profiles. 

20 A preferred embodiment is a transgenic plant that is homozygous for the 

added heterologous nucleic acid; i.e., a transgenic plant that contains two added 
nucleic acid sequences, one gene at the same locus on each chromosome of a 
chromosome pair. A homozygous transgenic plant can be obtained by sexually 
mating (selfing) a heterozygous transgenic plant that contains a single added 

25 heterologous nucleic acid, germinating some of the seed produced and analyzing 
the resulting plants produced for altered expression of a polynucleotide of the 
present invention relative to a control plant (i.e., native, non-transgenic). Back- 
crossing to a parental plant and out-crossing with a non- transgenic plant are also 
contemplated. Alternatively, propagation of heterozygous transgenic plants could 

30 be accomplished through apomixis. 

The present invention provides a method of genotyping a plant comprising 
a polynucleotide of the present invention. Genotyping provides a means of 
distinguishing homologs of a chromosome pair and can be used to differentiate 
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segregants in a plant population. Molecular marker methods can be used for 
phylogenetic studies, characterizing genetic relationships among crop varieties, 
identifying crosses or somatic hybrids, localizing chromosomal segments affecting 
monogenic traits, map based cloning, and the study of quantitative inheritance. 
5 See, e.g., Plant Molecular Biology: A Laboratory Manual, Chapter 7, Clark, Ed., 
Springer-Verlag, Berlin (1997). For molecular marker methods, see generally, 
The DNA Revolution by Andrew H. Paterson 1996 (Chapter 2) in: Genome 
Mapping in Plants (ed. Andrew H. Paterson) by Academic Press/R. G. Landis 
Company, Austin, Texas, pp.7-21. 

1 0 The particular method of genotyping in the present invention may employ 

any number of molecular marker analytic techniques such as, but not limited to, 
restriction fragment length polymorphisms (RFLPs). RFLPs are the product of 
allelic differences between DNA restriction fragments caused by nucleotide 
sequence variability. Thus, the present invention further provides a means to 

15 follow segregation of a gene or nucleic acid of the present invention as well as 
chromosomal sequences genetically linked to these genes or nucleic acids using 
such techniques as RFLP analysis. 

Plants which can be used in the method of the invention include 
monocotyledonous and dicotyledonous plants. Preferred plants include maize, 

20 wheat, rice, barley, oats, sorghum, millet, rye, soybean, sunflower, alfalfa, canola 
and cotton. 

Seeds derived from plants regenerated from transformed plant cells, plant 
parts or plant tissues, or progeny derived from the regenerated transformed 
plants, may be used directly as feed or food, or further processing may occur. 

25 All publications cited in this application are herein incorporated by 

reference to the same extent as if each individual publication or patent application 
was specifically and individually indicated to be incorporated by reference. 

The present invention will be further described by reference to the 
following detailed examples. It is understood, however, that there are many 

30 extensions, variations, and modifications on the basic theme of the present 
invention beyond that shown in the examples and description, which are within 
the spirit and scope of the present invention. 
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Example 1 

Library construction used for the maize LEC1 EST's 

5 

A. Total RNA Isolation 

Total RNA was isolated from maize embryo and regenerating callus tissues 
with TRIzol Reagent (Life Technology Inc. Gaithersburg, MD) using a modification 
of the guanidine isothiocyanate/acid-phenol procedure described by Chomczynski 

10 and Sacchi (Chomczynski, P., and Sacchi, N. Anal. Biochem. 162, 156 (1987)). 
In brief, plant tissue samples were pulverized in liquid nitrogen before the addition 
of the TRIzol Reagent, and then were further homogenized with a mortar and 
pestle. Addition of chloroform followed by centrifugation was conducted for 
separation of an aqueous phase and an organic phase. The total RNA was 

15 recovered by precipitation with isopropyl alcohol from the aqueous phase. 

B. Poly(A)+ RNA Isolation 

The selection of poly(A)+ RNA from total RNA was performed using 
PolyATact system (Promega Corporation. Madison, Wl). In brief, biotinylated 
20 oligo(dT) primers were used to hybridize to the 3' poly(A) tails on mRNA. The 
hybrids were captured using streptavidin coupled to paramagnetic particles and a 
magnetic separation stand. The mRNA was washed at high stringent condition 
and eluted by RNase-free deionized water. 

25 C. cDNA Library Construction 

cDNA synthesis was performed and unidirectional cDNA libraries were 
constructed using the Superscript Plasmid System (Life Technology Inc. 
Gaithersburg, MD). The first stand of cDNA was synthesized by priming an 
oligo(dT) primer containing a Not I site. The reaction was catalyzed by 
30 Superscript Reverse Transcriptase II at 45°C. The second strand of cDNA was 
labeled with alpha- 32 P-dCTP and a portion of the reaction was analyzed by 
agarose gel electrophoresis to determine cDNA sizes. cDNA molecules smaller 
than 500 base pairs and unligated adapters were removed by Sephacryl-S400 
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chromatography. The selected cDNA molecules were ligated into pSPORTI 
vector in between of Not I and Sal I sites. 



Example 2 

5 Sequencing and cDNA subtraction procedures used for maize LEC1 EST's 

A. Sequencing Template Preparation 

Individual colonies were picked and DNA was prepared either by PCR with 
M1 3 forward primers and M13 reverse primers, or by plasmid isolation. All the 
10 cDNA clones were sequenced using M13 reverse primers. 



B. Q-bot Subtraction Procedure 

cDNA libraries subjected to the subtraction procedure were plated out on 
22 x 22 cm 2 agar plate at density of about 3,000 colonies per plate. The plates 

15 were incubated in a 37°C incubator for 12-24 hours. Colonies were picked into 
384-well plates by a robot colony picker, Q-bot (GENETIX Limited). These plates 
were incubated overnight at 37°C. 

Once sufficient colonies were picked, they were pinned onto 22 x 22 cm 
nylon membranes using Q-bot. Each membrane contained 9,216 colonies or 

20 36,864 colonies. These membranes were placed onto agar plate with appropriate 
antibiotic. The plates were incubated at 37°C for overnight. 

After colonies were recovered on the second day, these filters were placed 
on filter paper prewetted with denaturing solution for four minutes, then were 
incubated on top of a boiling water bath for additional four minutes. The filters 

25 were then placed on filter paper prewetted with neutralizing solution for four 
minutes. After excess solution was removed by placing the filters on dry filter 
papers for one minute, the colony side of the filters were place into Proteinase K 
solution, incubated at 37°C for 40-50 minutes. The filters were placed on dry filter 
papers to dry overnight. DNA was then cross-linked to nylon membrane by UV 

30 light treatment. 

Colony hybridization was conducted as described by Sambrook, J., Fritsch, 
E.F. and Maniatis, T., (in Molecular Cloning: A laboratory Manual, 2 nd Edition). 
The following probes were used in colony hybridization: 
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1. First strand cDNA from the same tissue from which the library was made to 
remove the most redundant clones. 

2. 48-1 92 most redundant cDNA clones from the same library based on previous 
sequencing data. 

3. 1 92 most redundant cDNA clones in the entire corn sequence database. 

4. A Sa!-A20 oligo nucleotide; TCG ACC CAC GCG TCC GAA AAA AAA AAA 
AAA AAA AAA, removes clones containing a poly A tail but no cDNA. 

5. cDNA clones derived from rRNA 

The image of the autoradiography was scanned into computer and the signal 
intensity and cold colony addresses of each colony was analyzed. Re-arraying of 
cold-colonies from 384 well plates to 96 well plates was conducted using Q-bot. 

Example 3 

Identification of Maize LEC1 EST's from a Computer Homology Search 

Gene identities were determined by conducting BLAST (Basic Local 
Alignment Search Tool; Altschul, S. F., et ai, (1993) J. Mol. Biol. 215:403-410; 
see also www.ncbi.nlm.nih.gov/BLAST/) searches under default parameters for 
similarity to sequences contained in the BLAST u nr" database (comprising all non- 
redundant GenBank CDS translations, sequences derived from the 3-dimensional 
structure Brookhaven Protein Data Bank, the last major release of the SWISS- 
PROT protein sequence database, EMBL, and DDBJ databases). The cDNA 
sequences were analyzed for similarity to all publicly available DNA sequences 
contained in the "nr" database using the BLASTN algorithm. The DNA sequences 
were translated in all reading frames and compared for similarity to all publicly 
available protein sequences contained in the u nr" database using the BLASTX 
algorithm (Gish, W. and States, D. J. (1993) Nature Genetics 3:266-272) provided 
by the NCBI. In some cases, the sequencing data from two or more clones 
containing overlapping segments of DNA were used to construct contiguous DNA 
sequences. 
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EXAMPLE 4 

Composition of cDNA Libraries Used to Isolate and Sequence 
Additional cDNA Clones 

cDNA libraries representing mRNAs from various corn, poppy, soybean 
and Vernonia tissues were prepared (see Table 1). The characteristics of the 
libraries are described below. 

TABLE 1 

cDNA Libraries from Corn, Poppy, Soybean and Vernonia 



Tissue 

Corn endosperm 20 days After pollination* 

Prickly poppy developing seeds 

Soybean embryo, 6 to 10 days after flowering 

Soybean embryo, 13 days after flowering 

Soybean embryogenic suspension 2 weeks after 

subculture 

Soybean mature embryo 8 weeks after subculture 

Soybean embryogenic suspension 

Vernonia developing seed* 

These libraries were normalized essentially as described in U.S. Pat. No. 
1 5 5,482,845, incorporated herein by reference. 

cDNA libraries were prepared in Uni-ZAP™ XR vectors according to the 
manufacturer's protocol (Stratagene Cloning Systems, La Jolla, CA). Conversion 
of the Uni-ZAP™ XR libraries into plasmid libraries was accomplished according 

20 to the protocol provided by Stratagene. Upon conversion, cDNA inserts were 
contained in the plasmid vector pBluescript. cDNA inserts from randomly picked 
bacterial colonies containing recombinant pBluescript plasmids were amplified vi£ 
polymerase chain reaction using primers specific for vector sequences flanking 
the inserted cDNA sequences or plasmid DNA was prepared from cultured 

25 bacterial cells. Amplified insert DNAs or plasmid DNAs were sequenced in dye- 
primer sequencing reactions to generate partial cDNA sequences (expressed 
sequence tags or "ESTs"; see Adams, M. D. era/., (1991) Science 252:1651). 
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The resulting ESTs were analyzed using a Perkin Elmer Model 377 fluorescent 
sequencer. 

EXAMPLE 5 

5 Identification of cDNA Clones obtained from tissue described in Table 1 

ESTs encoding plant transcription factors were identified by conducting 
BLAST (Basic Local Alignment Search Tool; Altschul, S. F., etal., (1993) J. Mol. 
Biol. 275:403-410; see also www.ncbi.nlm.nih.gov/BLAST/) searches for similarity 

1 0 to sequences contained in the BLAST "nr* database (comprising all non- 
redundant GenBank CDS translations, sequences derived from the 3-dimensional 
structure Brookhaven Protein Data Bank, the last major release of the SWISS- 
PROT protein sequence database, EMBL, and DDBJ databases). The cDNA 
sequences obtained in Example 1 were analyzed for similarity to all publicly 

15 available DNA sequences contained in the "nr" database using the BLASTN 
algorithm provided by the National Center for Biotechnology Information (NCBI). 
The DNA sequences were translated in all reading frames and compared for 
similarity to all publicly available protein sequences contained in the "nr" 
database using the BLASTX algorithm (Gish, W. and States. D. J. (1993) Nature 

20 Genetics 3:266-272 and Altschul, Stephen F., et al. (1997) Nucleic Acids Res. 
25:3389-3402) provided by the NCBI. 

Example 6 

Identification of protein motifs diagnostic for LEC1 genes. 

25 

To determine the structural requirements for a LEC1 gene, HAP3 
homologs were identified in our EST database and aligned. By analyzing 
sequence homology amongst the plant HAP3 family of transcriptional activators 
these sequences were observed to fall into a least two distinctive groups. All of 
30 the HAP3 sequences derived from seed or embryo specific libraries form a 

distinctive LEC1 group that suggests a common evolutionary origin (confirmed by 
phylodentrograms). For example within the U B domain" of all plant LEC1 
sequences examined, a highly conserved CCAAT-box binding motif has been 
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found to contain the non-variable residues methionine, proline, isoleucine, 
alanine, asparagine, valine, and isoleucine (MPIANVI). LEC1 genes are highly 
divergent outside of the region spanning the DNA binding and subunit interaction 
motifs. The low levels of homology between these genes make it difficult to 
5 identify these based solely on a hybridization strategy. Using sequences from 
maize, soybean, wheat, prickly poppy, Vernonia, and Arabidopsis a motif 
diagnostic for LEC1 genes was identified, the specific amino acid substitutions for 
these species was clarified (Figure 1 ) and the positions at which amino acid 
substitutions occur within the LEC1 group was determined (SEQ ID NO: 23). 
10 Using Blast, the motif in SEQ ID NO: 23 was used to correctly distinguish LECI's 
from other closely related plant HAP3 transcriptional activators. 

Example 7 

Transformation and Regeneration of Maize Callus 

15 

Immature maize embryos from greenhouse or field grown High type II 
donor plants were bombarded with a plasmid containing a polynucleotide of the 
invention (LEC1). The LEC1 polynucleotide was operably linked to a constitutive 
promoter such as nos, or an inducible promoter, such as In2, plus a plasmid 

20 containing the selectable marker gene PAT (Wohlleben et al. (1988) Gene 70:25- 
37) that confers resistance to the herbicide Bialaphos fused to the Green 
Fluorescence protein. Transformation was performed as follows. 

The ears were surface sterilized in 50% Chlorox bleach plus 0.5% Micro 
detergent for 20 minutes, and rinsed two times with sterile water. The immature 

25 embryos were excised and placed embryo axis side down (scutellum side up), 25 
embryos per plate. These were cultured on 560 L medium 4 days prior to 
bombardment in the dark. Medium 560 L is an N6-based medium containing 
Eriksson's vitamins, thiamine, sucrose, 2,4-D, and silver nitrate. The day of 
bombardment, the embryos were transferred to 560 Y medium for 4 hours and 

30 were arranged within the 2.5-cm target zone. Medium 560Y is a high osmoticum 
medium (560L with high sucrose concentration). 

A plasmid vector comprising a polynucleotide of the invention operably 
linked to the selected promoter was constructed. This plasmid DNA plus plasmid 
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DNA containing a PAT selectable marker was precipitated onto 1.1 (average 
diameter) tungsten pellets using a CaCI 2 precipitation procedure as follows: 100 
^l prepared tungsten particles (0.6 mg) in water, 20 ^l (2 |ig) DNA in TrisEDTA 
buffer (1 ^g total), 100 nl 2.5 M CaC1 2 , 40 nl 0.1 M spermidine. 
5 Each reagent was added sequentially to the tungsten particle suspension. 

The final mixture was sonicated briefly. After the precipitation period, the tubes 
were centrifuged briefly, liquid removed, washed with 500 ml 100% ethanol, and 
centrifuged again for 30 seconds. Again the liquid was removed, and 60 nl 100% 
ethanol was added to the final tungsten particle pellet. For particle gun 
10 bombardment, the tungsten/DNA particles were briefly sonicated and 5 jil spotted 
onto the center of each macrocarrier and allowed to dry about 2 minutes before 
bombardment. 

The sample plates were bombarded at a distance of 8 cm from the 
stopping screen to the tissue, using a Dupont biolistics helium particle gun. All 

1 5 samples received a single shot at 650 PSI, with a total of ten aliquots taken from 
each tube of prepared particles/DNA. 

Four to 12 hours post bombardment, the embryos were moved to 560P (a 
low osmoticum callus initiation medium similar to 560L but with lower silver 
nitrate), for 3-7 days, then transferred to 560R selection medium, an N6 based 

20 medium similar to 560P containing 3 mg/liter Bialaphos, and subcultured every 2 
weeks. Multicellular GFP cell clusters became visible after two weeks and their 
numbers were periodically recorded. After approximately 10 weeks of selection, 
selection-resistant GFP positive callus clones were sampled for PCR and activity 
of the polynucleotide of interest. Positive lines were transferred to 288J medium, 

25 an MS-based medium with lower sucrose and hormone levels, to initiate plant 
regeneration. Following somatic embryo maturation (2-4 weeks), well-developed 
somatic embryos were transferred to medium for germination and transferred to 
the lighted culture room. Approximately 7-10 days later, developing plantlets 
were transferred to medium in tubes for 7-10 days until plantlets were well 

30 established. Plants were then transferred to inserts in flats (equivalent to 2.5" 
pot) containing potting soil and grown for 1 week in a growth chamber, 
subsequently grown an additional 1-2 weeks in the greenhouse, then transferred 
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to Classic™ 600 pots (1.6 gallon) and grown to maturity. Plants are monitored for 
expression of the polynucleotide of interest. 

Example 8 

5 Ectopic Expression of Maize LEC1 to Induce Somatic Embryogenesis 

Using the genotype High type II as an example, embryos were isolated and 
cultured on 560L medium for 3-5 days. Four to twelve hours before bombardment 
these embryos were transferred to high osmotic 560Y medium. Expression 

10 cassettes containing the LEC1 cDNA were then co-introduced into the scutella of 
these embryos along with an expression cassette containing the Pat gene fused 
to the Green Fluorescent protein using methods described in Example 7. 
Embryos from a single ear were divided evenly between treatments. Four to 12 
hours following bombardment embryos were then transferred back to a low 

15 osmoticum callus initiation medium (560P) and incubated in the dark at 26°C. 
After 3-7 days of culture these embryos were moved to 560R selection medium. 
Cultures were then transferred every two weeks until transformed colonies 
appear. Cultures were also examined microscopically for GFP expression. LEC1 
expression was expected to stimulate adventive embryo formation. This was 

20 apparent when the cultures were compared to controls (transformed without the 
LEC1 cDNA or non-induced). 

A. Ectopic expression of the maize LEC1 polynucleotide in tobacco is 
sufficient to induce somatic embryogenesis in tobacco leaves 

25 A maize LEC1 polynucleotide was placed into an agrobacterium 

expression cassette driven by the maize safener-induced In2 promoter (this 
promoter is leaky and expresses at low levels without induction). Also between 
the left and right T-DNA borders was the bar gene driven by 35S promoter and 
the Green Fluorescence Protein driven by the ubiquitin promoter. A similar 

30 construct was made without the LEC1 polynucleotide to be used as a control. 
Tobacco leaf discs from variety SR1 were co-cultured with Agrobacterium as 
described by Horsch etal. (1985, Science 227:1229-1231) except selecting with 
bialaphos rather than kanamycin. Transformants were selected on medium 



WO 00/28058 PCT/US99/26514 

-45- 

containing 3mg/l bialaphos. Of the numerous transformed shoot obtained, ectopic 
embryos were visible on the leaves of a single LEC1 transformant. None were 
visible on control plants. Although the frequency was low, ectopic somatic 
embryo formation was also reported to be a rare event in Arabidopsis LEC1 
5 transformants (Lotan et al. 1998). 

B. Transformation frequency was improved by LEC1 introduced using 
particle-mediated DNA delivery. 

A series of expression cassettes were made to evaluate the effects of 

10 LEC1 expression on maize transformation. The maize LEC1 polynucleotide was 
placed under the control of the In2 promoter (weakly induced with the auxin levels 
used under normal culture conditions and strongly-induced with safener), the 
barley NUC1 promoter (expressed strongly in the nucelius), the Ubiquitin 
promoter (strongly expressed constitutively), and the nos promoter (weakly 

1 5 expressed constitutively). A frame-shift version of the In2: LEC 1 cassette was 
made along with an ln2:ZM-NF-YB (designated as ln2:HAP3 henceforth) 
construct (The maize ZM NF-YB is non-LEC1 type of HAP3 transcriptional 
activator (Li et al Nucleic Acids Res. 20:1087-1091) for use as negative controls. 
All of these constructs were co-bombarded with the Pat~GFP fusion construct 

20 (designated as PAT-GFP) into high type II embryos as described in Example 7. 
Also, as in Example 7, immature embryos were harvested from separate ears, and 
the embryos from each ear were divided equally between treatments to account 
for ear-to-ear variability (for example, in an experiment comparing a control 
plasmid with this same plasmid + LEC1, one-half the total embryos from each ear 

25 would be used for each treatment. In some cases the control treatment contained 
the Pat~GFP construct co-bombarded with GUS. Transformation frequency was 
determined by counting the numbers of embryos with large multicellular GFP- 
positive cells clusters using a GFP microscope, and representing these as a 
percentage of the original number of embryos bombarded. No distinction was 

30 made between embryos with single or multiple events. In all cases, the functional 
LEC1 expression cassettes increased transformation frequencies over the control 
treatment (the LEC1 expression cassette also increased the incidence of multiple, 
i.e. 2-3, multicellular transgenic clones growing from the same immature embryo, 
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but as stated above we only scored these as a single event, and are providing a 
conservative representation of LEC1 's ability to improve transformation). For 
example, transformation frequencies in control treatments for three consecutive 
experiments were 5.1 , 7.4 and 0.8%. In balanced side-by-side comparisons for 
5 the same three experiments, transformation frequencies with the LEC1 

polynucleotide (ln2::LEC1::pinll) were 28.8, 25.7 and 12.4%, respectively. In 
addition to increasing the absolute number of transformants recovered from a 
given amount of target tissue, LEC1 transformants appeared earlier than the 
control transformants (suggesting that the LEC1 polynucleotide also stimulated 

10 growth rates). 

As a more stringent control, an expression vector was constructed in which 
a LEC1 gene, frame-shifted immediately after the start codon, was placed behind 
the In2 promoter In this experiment using embryos from 3 separate ears, the 
transformation frequencies in the control (frame-shifted LEC1) treatments were 

15 2.7, 6.0 and 2.0%, while the transformation frequencies for the LEC1 treatments 
were 62.7, 26 and 42.7%. This demonstrated clearly that expression of the in- 
frame LEC1 polynucleotide was associated with dramatic increases in 
transformation efficiency. 

Increasing the promoter strength (driving LEC1 expression) increased 

20 transformation frequencies. For example, an experiment was performed to 
compare the In2, nos and UBI promoters. Based on our experience with these 
two promoters driving other genes, the In2 promoter (in the absence of an inducer 
other than auxin from the medium) would drive expression at very low levels. The 
nos promoter has been shown to drive moderately-low levels of transgene 

25 expression (approximately 10- to 30-fold lower than the maize ubiquitin promoter, 
but still stronger than In2 under the culture conditions used in this experiment). 
Two control treatments were used in this experiment; the frame-shifted LEC1 
driven by the In2 promoter, or a maize ln2:HAP3 polynucleotide (a "non-LEC1 
type" representative of the transcriptional factor family to which LEC1 belongs). 

30 Both control treatments resulted in low transformation frequencies. After 3 weeks, 
the transformation frequency for the ln2:frame-shift-LEC1 (FS) treatment was 
4.8%, while for the ln2:HAP3 treatment it was 2%. The ln2:LEC1 , nos:LEC1 and 
UBI:LEC1 treatments resulted in 14%, 28% and 30% transformation frequencies, 
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respectively. Within these treatments there was also an increase in the overall 
frequency of large, rapidly growing calli. For the control treatments, the 
frequencies of large, vigorous GFP+ calli (relative to the starting number of 
embryos) was low (1 .6 and 0% for the frame-shift or ln2:HAP3, respectively). For 
5 the In2, nos and UBI treatments the overall frequencies of large, vigorous calli 
was 4, 13.3 and 20%, respectively. This is consistent with the interpretation that 
increased LEC1 expression resulted in more rapid in vitro growth of transgenic 
tissue. As is typical for transformation experiments scored in this fashion, 
between 3-6 weeks the number of recovered transformants continues to rise. 
1 0 After 5 weeks (post-bombardment), the frequency of healthy, growing 

transformants was 4.8 and 7.3% for the FS and HAP3 controls, while for the In2, 
nos and UBI-driven LEC1 treatments the frequencies were 22, 29.3 and 35.3%. 

C. Transformation frequency was improved by LEC1 introduced using 
15 agrobacterium. 

The Agrobacterium strains containing the superbinary plasmids described 
in Example 8A were used to transformed High type II embryos. Briefly, colonies 
containing the engineered Agrobacterium were grown to log phase in minimal A 
medium. Log phase cells were collected by centrifugation and resuspended in 

20 561 Q medium (N6 salts, Eriksson's vitamins, 1 .5mg/l 2,4-D , 68.5g/l sucrose, 
36g/l glucose, plus 20mg/l acetosyringone). Immature embryos, 1.5-2mm in 
length, were excised and immersed in this solution at a concentration of 5 X 10 
bacterial cells/ml. Embryos were vortexed in this medium and allowed to sit for 5 
minutes. The embryos were then removed and placed on 562P medium (560P 

25 medium with 1 0OmM acetosyringone and incubated at 20°C for 3 days. Embryos 
were moved again to 563N medium (an agar solidified medium similar to 560P 
with 100mg/l carbenicillin, 0.5g/l MES and reduced 2,4-D) and cultured at 28°C for 
3 days. Embryos were then moved to 5630 medium (563N medium with 3mg/l 
bialaphos) and transferred thereafter every 14 days to fresh 5630 medium. 

30 Bialaphos resistant GFP+ colonies were counted using a GFP microscope 

and transformation frequencies were determined as described in example 8B. 
Similar to particle gun experiments, transformation frequencies were greatly 
increased in the LEC1 treatment. For example, transformation frequencies for the 
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control treatment across embryos taken from 7 separate ears were 7.1 , 40.9, 
11.1, 7.4, 11.5, 12, 30.8, and 16.6%. The side-by-side comparison for the LEC 
treatment (in the same order of ears as above) showed that transformation 
frequencies were 13.5, 47, 55.8, 37.1, 40.6, 30, 57.1 and 40.8%. Averaged 
5 across all 7 ears, the average transformation frequency for the control was 1 6.6% 
while that of the LEC1 treatment was 40.8%. This represents a substantial 
increase for an already high baseline produced by Agrobacterium-mediated 
transformation. Comparing across ears, it was observed that the beneficial 
effects on transformation frequency were the greatest when the control 
1 0 frequencies were low. 

D. Transformants were recovered using LEC1 expression under reduced 
auxin levels or in the absence of auxins in the medium, and in the absence 
of herbicide or antibiotic selection. 

15 To determine if LEC1 could be used in a positive selection scheme, 

particle gun transformation experiments were initiated as described in Example 4 
and transformants were selected on medium with normal auxin levels, or on 
medium with reduced or no auxin, or visually (using GFP) on medium without 
bialaphos. Transformation frequencies were based on the numbers of embryos 

20 with one or more multicellular GFP positive cell clusters. In the first experiment to 
test this concept, there were two treatment variables. The first was that immature 
embryos were bombarded with the control plasmid (UBI:PAT~GFP) or with 
UBI:PAT~GFP+ln2:LEC1 . The second variable was that the bombarded embryos 
were divided onto either normal bialaphos-containing selection medium (with 

25 normal auxin levels of 2 mg/l 2,4-D), or medium with no bialaphos and reduced 
2,4-D levels (0.5 mg/l). As expected from previous results, on bialaphos selection 
the LEC1 treatment resulted in a higher transformation frequency than the control 
(5.7 versus 2.5%). It was also anticipated that the low auxin medium (0.5 mg/l 
2,4-D) would result in reduced growth rates. Consistent with this, for the control 

30 plasmid treatment (UBI:PAT~GFP), recovery of GFP-expressing (fluorescent) 
colonies was reduced relative to highly-effective bialaphos-selection treatment, 
dropping down to 0.6%. In contrast, it appeared that LEC1 expression, through 
its stimulation of embryogenesis, may have compensated for the low auxin 



WO 00/28058 PCT/US99/26514 



-49- 



environment, providing a growth advantage to the transgenic colonies, and 
maintaining the efficiency of transformant recovery at 4.0% (still in the same 
range as the LEC1/bialaphos-selected treatment). It's clear from this result that 
the inclusion of LEC1 improved colony growth on reduced auxin relative to the 
control. 

On medium completely devoid of auxin, colonies were only observed in the 
LEC1 treatment. In this experiment, immature embryos were bombarded with 
either the control plasmid (UBI:PAT~GFP) or with UBI:PAT~GFP+ln2:LEC1, and 
then plated either onto 3.0 mg/l bialaphos, 2.0 mg/l 2,4-D medium or onto no- 
bialaphos, no 2,4-D medium (in this latter treatment, wild-type maize callus will 
not exhibit embryogenic growth). Again, as expected, the LEC1 polynucleotide 
increased transformation to 22.7% over the control plasmid value of 8% on 
normal auxin-containing, bialaphos selection medium. Also, as expected, no 
transformants were recovered with the control plasmid on medium devoid of 
exogenous auxin. Surprisingly, in the LEC1 treated embryos transformants were 
recovered at a 4% frequency (this was still higher than the control plasmid on 

bialaphos selection). 

Even on auxin-containing medium, the LEC1 polynucleotide in combination 
with GFP+ expression can be used to recover transformants without chemical 
selection. For example, under these conditions the recovery of transformants was 
relatively efficient (16% compared to 18% for bialaphos selection), but this 
required more diligence than the low- or no-auxin treatments above to separate 
the GFP-expressing colonies from the growing callus population. 

E. LEC1 improves the embryogenic phenotype and regeneration capacity of 
inbreds. 

Immature embryos from the inbred PHP38 were isolated, cultured and 
transformed as described in example 4 with the following changes. Embryos were 
initially cultured on 601 H medium (a MS based medium with 0.1 mg/l zeatin, 2mg/l 
2,4-D, MS and SH vitamins, proline, silver nitrate, extra potassium nitrate, casein 
hydrolysate, gelrite, 10g/l glucose and 20g/l sucrose). Prior to bombardment 
embryos were moved to a high osmoticum medium (modified Duncan's with 2mg/l 
2,4-D and 12% sucrose). Post bombardment, embryos were moved to 601 H 
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medium with 3mg/l bialaphos for two weeks. Embryos were then moved to 601 H 
medium without proline and casein hydrolysate with 3mg/l bialaphos and 
transferred every two weeks. Transformation frequency was determined by 
counting the numbers of bialaphos resistant GFP-positive colonies. Colonies 
5 were also scored on whether they had an embryogenic (regenerable) or non- 
embryogenic phenotype. In PHP38, the LEC1 polynucleotide increased 
transformation frequency and improved the regenerative potential of the callus. 
For example, a balanced experiment (the embryos from each harvested ear were 
divided equally between treatments) was conducted in which PHP38 immature 

10 embryos were bombarded with the control plasmid (UBI::PAT~GFP::pinil) in one 
treatment, with the UBI::PAT-GFP::pinll plasmid + ln2::LEC1, or with the 
UBI::PAT~GFP::pinll plasmid + nuc1::LEC1 (a maize nucellus-specific promoter 
driving LEC1 expression). The frequency of GFP+ calli growing on bialaphos- 
containing media (relative to the starting number of embryos) was determined 6 

15 weeks after bombardment. For the control treatment, the transformation 
frequency was 1.2%, while for the ln2:LEC1 and nud ::LEC1 treatments the 
transformation frequencies were 3.2 and 2.0% respectively. In addition, the 
presence of the LEC1 polynucleotide appeared to greatly improve the 
regeneration capacity of the recovered transformants. None of the control 

20 transformants (UBI::PAT-GFP::pinll alone) had an embryogenic, regenerable 
phenotype, while the transformants recovered from the ln2:LEC1 and nuc1::LEC1 
treatments all exhibited a more vigorous, embryogenic growth pattern. This has 
been born out in the ability to recover plants. Callus from the ln2:LEC1 and 
nuc1::LEC1 treatments has produced many healthy plants. 

25 

Example 9 

Transient Expression of the LEC1 polynucleotide Product 
to Induce Somatic Embryogenesis 

30 It may be desirable to "kick start" somatic embryogenesis by transiently 

expressing the LEC-1 polynucleotide product. This can be done by delivering 
LEC1 Sapped polyadenylated RNA, expression cassettes containing LEC-1 
DNA, or LEC-1 protein. All of these molecules can be delivered using a biolistics 
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particle gun. For example 5'capped polyadenylated LEC1 RNA can easily be 
made in vitro using Ambion's mMessage mMachine kit. Following the procedure 
outline above RNA is co-delivered along with DNA containing an agronomically 
useful expression cassette. The cells receiving the RNA will immediately form 
5 somatic embryos and a large portion of these will have integrated the agronomic 
gene. Plants regenerated from these embryos can then be screened for the 
presence of the agronomic gene. 

Example 10 

1 0 Use of the maize LEC1 to induce apomixis 

Maize expression cassettes directing LEC1 expression to the inner 
integument or nucellus can easily be constructed. An expression cassette 
directing expression of the LEC1 polynucleotide to the nucellus was made using 

15 the barley Nuc1 promoter. Embryos were co-bombarded with the selectable 
marker PAT fused to the GFP gene along with the nucellus specific LEC1 
expression cassette described above. Both inbred (PHP38) and GS3 
transformants were obtained and regenerated as described in examples 4 and 5. 
Transformation frequencies were also increased over the control using the 

20 nuc1:LEC1 polynucleotide (see Example 8 above). 

It is anticipated that the regenerated plants will then be capable of 
producing de novo embryos from LEC1 expressing nucellar cells. This is 
complemented by pollinating the ears to promote normal central cell fertilization 
and endosperm development. In another variation of this scheme, nuc1:LEC1 

25 transformations could be done using a FIE-null genetic background which would 
promote both de novo embryo development and endosperm development without 
fertilization (see Ohad et a/. 1999 The Plant Cell 11:407-415; also pending US 
application Serial No. 60/151575 filed August 31 , 1999). Upon microscopic 
examination of the developing embryos it will be apparent that apomixis has 

30 occurred by the presence of embryos budding off the nucellus. In yet another 
variation of this scheme the LEC1 polynucleotide could be delivered as described 
above into a homozygous zygotic-embryo-lethal genotype. Only the adventive 
embryos produced from somatic nucellus tissue would develop in the seed. 
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EXAMPLE 11 
Expression of Chimeric Genes in Microbial Cells 

5 

The cDNAs encoding the instant transcription factors can be inserted into 
the T7 E. coli expression vector pBT430. This vector is a derivative of pET-3a 
(Rosenberg etal. (1987) Gene 56:125-135) which employs the bacteriophage T7 
RNA polymerase/T7 promoter system. Plasmid pBT430 was constructed by first 

10 destroying the EcoR I and Hind III sites in pET-3a at their original positions. An 
oligonucleotide adaptor containing EcoR I and Hind III sites was inserted at the 
BamH I site of pET-3a. This created pET-3aM with additional unique cloning sites 
for insertion of genes into the expression vector. Then, the Nde I site at the 
position of translation initiation was converted to an Nco I site using 

15 oligonucleotide-directed mutagenesis. The DNA sequence of pET-3aM in this 
region, 5'-CATATGG, was converted to ff-CCCATGG in pBT430. 

Plasmid DNA containing a cDNA may be appropriately digested to release a 
nucleic acid fragment encoding the protein. This fragment may then be purified 
on a 1% NuSieve GTG™ low melting agarose gel (FMC). Buffer and agarose 

20 contain 1 0 ng/ml ethidium bromide for visualization of the DNA fragment. The 
fragment can then be purified from the agarose gel by digestion with GELase™ 
(Epicentre Technologies) according to the manufacturer's instructions, ethanol 
precipitated, dried and resuspended in 20 \iL of water. Appropriate 
oligonucleotide adapters may be ligated to the fragment using T4 DNA ligase 

25 (New England Biolabs, Beverly, MA). The fragment containing the ligated 

adapters can be purified from the excess adapters using low melting agarose as 
described above. The vector pBT430 is digested, dephosphorylated with alkaline 
phosphatase (NEB) and deproteinized with phenol/chloroform as described 
above. The prepared vector pBT430 and fragment can then be ligated at 16°C 

30 for 1 5 hours followed by transformation into DH5 electrocompetent cells (GIBCO 
BRL). Transformants can be selected on agar plates containing LB media and 
100 ng/mL ampicillin. Transformants containing the polynucleotide encoding the 
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transaiption factor are then screened for the correct orientation with respect to 
the T7 promoter by restriction enzyme analysis. 

For high level expression, a plasmid clone with the cDNA insert in the 
correct orientation relative to the T7 promoter can be transformed into E. coli 
5 strain BL21(DE3) (Studier etal. (1986) J. Mol. Biol. 789:113-130). Cultures are 
grown in LB medium containing ampicillin (100 mg/L) at 25°C. At an optical 
density at 600 nm of approximately 1, IPTG (isopropylthio-p-galactoside, the 
inducer) can be added to a final concentration of 0.4 mM and incubation can be 
continued for 3 hours at 25°C. Cells are then harvested by centrifugation and re- 

1 0 suspended in 50 fiL of 50 mM Tris-HCI at pH 8.0 containing 0. 1 mM DTT and 0.2 
mM phenyl methylsulfonyl fluoride. A small amount of 1 mm glass beads can be 
added and the mixture sonicated 3 times for about 5 seconds each time with a 
microprobe sonicator. The mixture is centrifuged and the protein concentration of 
the supernatant determined. One ug of protein from the soluble fraction of the 

1 5 culture can be separated by SDS-polyacrylamide gel electrophoresis. Gels can 
be observed for protein bands migrating at the expected molecular weight. 

EXAMPLE 12 

Evaluating Compounds for Their Ability to Inhibit the Activity 
20 of Plant Transcription Factors 

The transcription factors described herein may be produced using any 
number of methods known to those skilled in the art. Such methods include, but 
are not limited to, expression in bacteria as described in Example 6, or expression 

25 in eukaryotic cell culture, in plants, and using viral expression systems in suitably 
infected organisms or cell lines. The instant transcription factors may be 
expressed either as mature forms of the proteins as observed in vivo or as fusion 
proteins by covalent attachment to a variety of enzymes, proteins or affinity tags. 
Common fusion protein partners include glutathione S-transferase ("GST"), 

30 thioredoxin ("Trx"), maltose binding protein, and C- and/or N-terminal 

hexahistidine polypeptide ("(His) 6 "). The fusion proteins may be engineered with 
a protease recognition site at the fusion point so that fusion partners can be 
separated by protease digestion to yield intact mature enzyme. Examples of such 
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proteases include thrombin, enterokinase and factor Xa. However, any protease 
can be used which specifically cleaves the peptide connecting the fusion protein 
and the enzyme. 

Purification of the instant transcription factors, if desired, may utilize any 

5 number of separation technologies familiar to those skilled in the art of protein 
purification. Examples of such methods include, but are not limited to, 
homogenization, filtration, centrifugation, heat denaturation, ammonium sulfate 
precipitation, desalting, pH precipitation, ion exchange chromatography, 
hydrophobic interaction chromatography and affinity chromatography, wherein the 

1 0 affinity ligand represents a substrate, substrate analog or inhibitor. When the 
transcription factors are expressed as fusion proteins, the purification protocol 
may include the use of an affinity resin which is specific for the fusion protein tag 
attached to the expressed enzyme or an affinity resin containing ligands which 
are specific for the enzyme. For example, a transcription factor may be 

1 5 expressed as a fusion protein coupled to the C-terminus of thioredoxin. In 
addition, a (His) 6 peptide may be engineered into the N-terminus of the fused 
thioredoxin moiety to afford additional opportunities for affinity purification. Other 
suitable affinity resins could be synthesized by linking the appropriate ligands to 
any suitable resin such as Sepharose-4B. In an alternate embodiment, a 

20 thioredoxin fusion protein may be eluted using dithiothreitol; however, elution may 
be accomplished using other reagents which interact to displace the thioredoxin 
from the resin. These reagents include 0-mercaptoethanol or other reduced thiol. 
The eluted fusion protein may be subjected to further purification by traditional 
means as stated above, if desired. Proteolytic cleavage of the thioredoxin fusion 

25 protein and the enzyme may be accomplished after the fusion protein is purified 
or while the protein is still bound to the ThioBond™ affinity resin or other resin. 

Crude, partially purified or purified enzyme, either alone or as a fusion 
protein, may be utilized in assays for the evaluation of compounds for their ability 
to inhibit enzymatic activition of the transcription factors disclosed herein. Assays 

30 may be conducted under well-known experimental conditions that permit optimal 
enzymatic activity. 
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Example 13 

LEC1 expression resulted in increased growth rates, which could be used 
as a screening criterion for positive selection of transformants. 

5 Using two promoters of increasing strength to drive LEC1 expression in maize, 
it appeared that LEC1 stimulated callus growth over control treatments and the 
stronger promoter driving LEC1 resulted in faster growth than with the low-level 
promoter. For example, an experiment was performed to compare the In2 and 
nos promoters. As noted above, based on our experience with these two 

1 0 promoters driving other genes, the In2 promoter (in the absence of an inducer 
other than auxin from the medium) would drive expression at very low levels. The 
nos promoter has been shown to drive moderately-low levels of transgene 
expression (approximately 10- to 30-fold lower than the maize ubiquitin promoter, 
but still stronger than In2 under the culture conditions used in this experiment). 

1 5 One control treatment was used in this experiment, the UBI:PAT~GFPmo:pinll 
construct by itself (with no LEC1 ). Hi-ll immature embryos were bombarded as 
previously described, and transgenic, growing events were scored at 3 and 6 
weeks. The control treatment resulted in a transformation frequency of 0.8%. 
The ln2:LEC1 and nos:LEC1 treatments resulted in transformation frequencies of 

20 26.5 and 40.7%, respectively. 

Within these treatments there was also an increase in the overall frequency of 
large, rapidly growing calli, relative to the control treatment. For this data, the 
fresh weight of transformed calli were recorded 2 months after bombardment. 
Assuming that all the transgenic events started as single transformed cells within 

25 a few days after bombardment, these weights represent the relative growth rate of 
these transformants during this period (all tissue was sub-cultured and weighed 
for each transformant; mean weights and standard deviations were calculated for 
each treatment). For the control treatment, the mean transformant weight after 
two months was 37 +/- 15 mg (n=6). For the ln2:LEC1 and nos:LEC1 treatments, 

30 the mean transformant weights were 126 +/- 1 06 and 441 +/- 430 mg, 

respectively. If the control treatment was set at a relative growth value of 1 .0, this 
means that transformants in the ln2:LEC1 and nos:LEC1 treatments grew 3.4 and 
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12-fold faster than the control. From this data, it appears that increasing LEC1 
expression resulted in a concomitant increase in callus growth rate. 

Example 14 

5 The use of LEC1 polynucleotide as a positive selection system for wheat 
transformation and for improving the regeneration capacity of wheat tissues 



Method 
Plant Material 

10 Seeds of wheat Hybrinova lines NH535 and BO 014 were sown into soil in 

plug trays for vernalisation at 6°C for eight weeks. Vernalized seedlings were 
transferred in 8" pots and grown in a controlled environment room. The growth 
conditions used were; 1) soil composition: 75% L&P fine-grade peat, 12% 
screened sterilized loam, 10% 6 mm screened, lime-free grit, 3% medium grade 

15 vermiculite, 3.5 kg Osmocote per m 3 soil (slow-release fertiliser, 15-11-13 NPK 
plus micronutrients), 0.5 kg PG mix per m 3 (14-16-18 NPK granular fertiliser plus 
micronutrients, 2) 16 h photoperiod (400W sodium lamps providing irradiance of 
ca. 750 \xE s' 1 m" 2 ), 18 to 20°C day and 14 to 16°C night temperature, 50 to 70% 
relative air humidity and 3) pest control: sulphur spray every 4 to 6 weeks and 

20 biological control of thrips using Amblyseius caliginosus (Novartis BCM Ltd, UK). 



Isolation of Explants and Culture Initiation 

Two sources of primary explants were used; scutellar and inflorescence 
tissues. For scutella, early-medium milk stage grains containing immature 

25 translucent embryos were harvested and surface-sterilized in 70% ethanol for 5 
min and 0.5% hypochlorite solution for 15-30 min. For inflorescences, tillers 
containing 0.5-1.0 cm inflorescences were harvested by cutting below the 
inflorescence-bearing node (the second node of a tiller). The tillers were trimmed 
to approximately 8-10 cm length and surface-sterilized as above with the upper 

30 end sealed with Nescofilm (Bando Chemical Ind. Ltd, Japan). 

Under aseptic conditions, embryos of approximately 0.5-1 .0 mm length 
were isolated and the embryo axis removed. Inflorescences were dissected from 
the tillers and cut into approximately 1 mm pieces. Thirty scutella or 1 mm 
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inflorescence explants were placed in the center (18 mm target circle) of a 90 mm 
Petri dish containing MD0.5 or L7D2 culture medium. Embryos were placed with 
the embryo-axis side in contact with the medium exposing the scutellum to 
bombardment whereas inflorescence pieces were placed randomly. Cultures 
5 were incubated at 25±°C in darkness for approximately 24 h before bombardment. 
After bombardment, explants from each bombarded plate were spread across 
three plates for callus induction. 

Culture Media 

10 The standard callus induction medium for scutellar tissues (MD0.5) 

consisted of solidified (0.5% Agarge!, Sigma A3301) modified MS medium 
supplemented with 9% sucrose, 10 mg I* 1 AgN0 3 and 0.5 mg I' 1 2,4-D (Rasco- 
Gaunt et a/., 1999). Inflorescence tissues were cultured on L7D2 which consisted 
of solidified (0.5% Agargel) L3 medium supplemented with 9% maltose and 2 mg 

15 I" 1 2,4-D (Rasco-Gaunt and Barcelo, 1999). The basal shoot induction medium, 
RZ contained L salts, vitamins and inositol, 3% w/v maltose, 0.1 mg I" 1 2,4-D and 
5 mg P zeatin (Rasco-Gaunt and Barcelo, 1999). Regenerated plantlets were 
maintained in RO medium with the same composition as RZ, but without 2,4-D 
and zeatin. 

20 

DNA Precipitation Procedure and Particle Bombardment 

Submicron gold particles (0.6 pm Micron Gold, Bio-Rad) were coated with 
a plasmid containing the maize ln-2:LEC1 construct following the protocol 
modified from the original Bio-Rad procedure (Barcelo and Lazzeri, 1995). The 

25 standard precipitation mixture consisted of 1 mg of gold particles in 50 pi SDW, 
50 pi of 2.5 M calcium chloride, 20 pi of 100 mM spermidine free base and 5 pi 
DNA (concentration 1 pg pi" 1 ). After combining the components, the mixture was 
vortexed and the supernatant discarded. The particles were then washed with 
150 pi absolute ethanol and finally resuspended in 85 pi absolute ethanol. The 

30 DNA/gold ethanol solution was kept on ice to minimise ethanol evaporation. For 
each bombardment, 5 pi of DNA/gold ethanol solution (ca. 60 pg gold) was 
loaded onto the macrocarrier. 
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Particle bombardments were carried out using DuPont PDS 1000/He gun 
with a target distance of 5.5 cm from the stopping plate at 650 psi acceleration 
pressure and 28 in. Hg chamber vacuum pressure. 

5 Regeneration of Transformants 

For callus induction, bombarded explants were distributed over the surface 
of the medium in the original dish and two other dishes and cultured at 25+1 °C in 
darkness for three weeks. Development of somatic embryos from each callus 
were periodically recorded. For shoot induction, calluses were transferred to RZ 

1 0 medium and cultured under 1 2 h light (250 *iE s* 1 rrf 2 , from cool white fluorescent 
tubes) at 25+1 °C for three weeks for two rounds. All plants regenerating from the 
same callus were noted. Plants growing more vigorously than the control cultures 
were potted in soil after 6-9 weeks in R0 medium. The plantlets were 
acclimatized in a propagator for 1-2 weeks. Thereafter, the plants were grown to 

1 5 maturity under growth conditions described above. 

DNA Isolation from Callus and Leaf Tissues 

Genomic DNA was extracted from calluses or leaves using a modification 
of the CTAB (cetyltriethylammonium bromide, Sigma H5882) method described by 

20 Stacey and Isaac (1 994). Approximately 1 00-200 mg of frozen tissues was 

ground into powder in liquid nitrogen and homogenised in 1 ml of CTAB extraction 
buffer (2% CTAB, 0.02 M EDTA, 0.1 M Tris-CI pH 8, 1.4 M NaCI, 25 mM DTT) for 
30 min at 65°C. Homogenised samples were allowed to cool at room temperature 
for 15 min before a single protein extraction with approximately 1 ml 24:1 v/v 

25 chlorofoimoctanol was done. Samples were centrifuged for 7 min at 1 3,000 rpm 
and the upper layer of supernatant collected using wide-mouthed pipette tips. 
DNA was precipitated from the supernatant by incubation in 95% ethanol on ice 
for 1 h. DNA threads were spooled onto a glass hook, washed in 75% ethanol 
containing 0.2 M sodium acetate for 10 min, air-dried for 5 min and resuspended 

30 in TE buffer. Five pi RNAse A was added to the samples and incubated at 37°C 
for 1 h. 
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For quantification of genomic DNA, gel electrophoresis was performed 
using an 0.8% agarose gel in 1x TBE buffer. One microlitre of the samples were 
fractionated alongside 200, 400, 600 and 800 ng pi" 1 1 uncut DNA markers. 

5 Polymerase chain reaction (PCR) analysis 

The presence of the maize LEC1 polynucleotide was analyzed by PCR 
using 100-200 ng template DNA in a 30 ml PCR reaction mixture containing 1X 
concentration enzyme buffer (10 mM Tris-HCI pH 8.8, 1.5 mM magnesium 
chloride, 50 mM potassium chloride, 0.1% Triton X-100), 200 pM dNTPs, 0.3 pM 

1 0 primers and 0.022 U TaqDNA polymerase (Boehringer Mannheim). 

Thermocycling conditions were as follows (30 cycles): denaturation at 95°C for 30 
s, annealing at 55°C for 1 min and extension at 72°C for 1 min. Primer sequences 
(F=forward; Reverse) used were: (F) 5'-CGC TCT GTC ACC TGT TGT ACT C- 
3* (R) 5'-CGT GAT GAA GCT GAT GTA CTC C-3\ Approximate PCR product 

15 length was 620 bp. 

Results 

Following experiments to show increased regeneration capacity and 
improvement of maize transformation frequencies by expression of maize LEC1 , 
20 the polynucleotide was then introduced into wheat scutellar and inflorescence 
explants, driven by the maize In2 promoter. Both tissues are used for wheat 
transformation. 

Subsequent to the induction of somatic embryos from both tissues after 
three weeks on a 2,4-D-containing induction medium, calluses were assessed 
25 prior to transfer onto shoot regeneration medium. Callus assessment involved: a) 
scoring calluses as 0 = non-embryogenic callus, 1 = 25%, 2 = 25-50%, 3 = 50- 
75%, 4 = 75-100% of callus surface embryogenic, and b) determining 
embryogenic capacity expressed in percentage as the number of embryogenic 
calluses / total number of calluses (scutella or inflorescence) assessed. 

30 

Scutellum calluses 

Mean callus scores of control (1.4 ± 0.3) and LEC-bombarded (1.4 ± 0.3) 
scutellar tissues of wheat line NH535 were not significantly different. However, 
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calius score of LEC-bombarded scutella (1 .5 ± 0.5) of wheat line BO 014 was 
significantly improved in comparison with the control (0.5 ± 0.2). Similarly, 
embryogenic capacity of line NH535 did not seem to be affected by LEG 
treatment (LEC calluses = 84.3 ± 9.3%, control calluses = 90%). However, LEC- 
5 bombarded line BO 014 had clear increases in the embryogenic callus frequency 
(LEC calluses = 75.4 ± 16.8%, control calluses = 36.7 ± 4.7%). Examining the 
quality of embryogenic calluses formed, both lines showed significant increases in 
the number of 'good 1 calluses produced i.e. calluses with scores of 3 or 4. 'Good' 
quality callus of line NH535 increased from 5 to 22.3 % whilst line BO 014 
1 0 increased from 0 to 23.6%. These calluses were generally large, rapidly growing 
and vigorous. 

After callus induction and assessment, calluses were transferred onto 
shoot induction media for a total of six weeks. Shoot regeneration of calluses was 
determined, as the number of shoot regenerating calluses / total number of 

1 5 calluses assessed (expressed as percentages). Shoot regeneration of cultures 
corresponded with the quality and quantity of somatic embryos produced in each 
callus. Hence, regeneration of LEC-bombarded (71.9 ± 12.1) and control (70 ± 
14) callus tissues of line NH535 were not significantly different. However, 
regeneration of LEC-bombarded calluses (52.3 ± 26.9) of wheat line BO 014 was 

20 significantly improved in comparison with the control (1 5.6 ± 6.3). 

To test the suitability of LEC as a positive selection system for wheat, 
sample tissues from vigorous calluses were analyzed for the presence of LEC 
sequences. Forty-one BO 014 and 13 NH535 calluses were selected. The 
results were that 10/41 BO 014 calluses and 8/13 NH535 were PCR positive. 

25 Thus, transformed lines were identified without selection at frequencies of 24.4% 
and 61.5%. These frequencies are comparable with conventional selection 
systems such as herbicide- and antibiotic-resistance systems (e.g. bar, npt\\) 
applied in wheat transformation where selection 'escape' frequencies are 
commonly high and variable. Furthermore, we know of no other report of wheat 

30 transformation by morphological selection in the absence of a selection agent. 

Callus transformation frequencies were 5.6% and 4.4% in NH535 and BO 
014 lines, respectively. Transgenic plants were also recovered from LEC-positive 
callus lines. Seven non-clonal plants were recovered from NH535 and six non- 
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clonal plants were recovered from BO 014 to give plant transformation 
frequencies of 3.9 and 3.3%, respectively, based on the number of explants 
bombarded. 

5 Inflorescence calluses 

The use of inflorescence tissues as explants for the tissue culture and 
transformation of wheat offer several advantages over seed explants such as 
scutella (Rasco-Gaunt and Barcelo, 1999). However, responses of these tissues 
to culture are highly genotype-dependent and calluses are often non-regenerative 

10 despite having a 'highly-embryogenic' appearance. Hence, LEC was introduced 
into inflorescence tissues to see whether regeneration could be enhanced on a 
poorly regenerating line such BO 014. 

Using line BO 014, shoot regeneration was significantly improved in LEC- 
bombarded tissues, although callus quality appeared similar in bombarded and 

15 control tissues. Whereas no shoot was regenerated from control cultures, eight 
plants were regenerated from LEC-bombarded calluses to give a shoot 
regeneration frequency of 10.7%. 

Tables 

20 

Summary of tissue culture data 



Wheat 
line 


Treatment 


Mean 
callus 
score 
(0-4) 


Embryo. 
Capacity 

(%) 


% Good 
calluses 
(score 
3&4) 


Regeneration 
(%) 




NH 535 


Control 


1.4 ±0.3 


90.0 


5 


70 ±14 




LEC1 


1.4 ±0.3 


84.3 ± 9.3 


22.3 


71.9 ±12.1 


BO 014 


Control 


0.5 ± 0.2 


36.7 ±4.7 


0 


15.6 ±6.3 




LEC1 


1.5 ±0.5 


75.4 ± 
16.8 


23.6 


52.3 ± 26.9 
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Type of callus produced per treatment per line 



Callus ocore 


NH535 


BOI 


314 




Control 


LEC1 


Control 


LEC1 


0 


10 


15.7 


63.2 


25 


1 


45 


43.3 


26.3 


30 


2 


40 


28.7 


10.5 


21.4 


3 


5 


11.2 


0 


17.9 


4 


0 


1.1 


0 


5.7 



Transformation Frequency 



Wheat line 


Callus line 


Plant line (non-clonal) 


NH 535 


10/180 (5.6%) 


7/180 (3.9%) 


BO 014 


8/180 (4.4%) 


6/180 (3.3%) 



10 



EXAMPLE 15 
Expression of Chimeric Genes in Dlcot Cells 



The LEC1 polynucleotide can also be used to improve the transformation of 
1 5 soybean. To demonstrate this the construct consisting of the In2 promoter and 
LEC1 coding region were introduced into embryogenic suspension cultures of 
soybean by particle bombardment using essentially the methods described in 
Parrott, WA, LM. Hoffman, D.F. Hildebrand, E.G. Williams, and G.B. Collins, 
(1989) Recovery of primary transformants of soybean, Plant Cell Rep. 7:615-617. 
20 This method with modifications is described below. 

Seed was removed from pods when the cotyledons were between 3 and 5 
mm in length. The seeds were sterilized in a Chlorox solution (0.5%) for 15 
minutes after which time the seeds were rinsed with sterile distilled water. The 
immature cotyledons were excised by first cutting away the portion of the seed 
25 that contains the embryo axis. The cotyledons were then removed from the seed 
coat by gently pushing the distal end of the seed with the blunt end of the scalpel 
blade. The cotyledons were then placed (flat side up) SB1 initiation medium (MS 
salts, B5 vitamins, 20 mg/L 2,4-D, 31 .5 g/l sucrose, 8 g/L TC Agar, pH 5.8). The 
Petri plates were incubated in the light (16 hr day; 75-80 nE) at 26° C. After 4 
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weeks of incubation the cotyledons were transferred to fresh SB1 medium. After 
an additional two weeks, globular stage somatic embryos that exhibit proliferative 
areas were excised and transferred to FN Lite liquid medium (Samoylov, V.M., 
D.M. Tucker, and W.A. Parrott (1998) Soybean [Glycine max (L.) Merrill] 

5 embryogenic cultures: the role of sucrose and total nitrogen content on 

proliferation. In Vitro Cell Dev. Biol.- Plant 34:8-13). About 10 to 12 small clusters 
of somatic embryos were placed in 250 ml flasks containing 35 ml of SB172 
medium. The soybean embryogenic suspension cultures were maintained in 35 
mL liquid media on a rotary shaker, 150 rpm, at 26°C with florescent lights (20 nE) 

10 on a 16:8 hour day/night schedule. Cultures were sub-cultured every two weeks 
by inoculating approximately 35 mg of tissue into 35 mL of liquid medium. 

Soybean embryogenic suspension cultures were then be transformed using 
particle gun bombardment (Klein etal. (1987) Nature (London) 327:70, U.S. 
Patent No. 4,945,050). A BioRad Biolistic™ PDS1000/HE instrument was used for 

1 5 these transformations. A selectable marker gene which was used to facilitate 
soybean transformation is a chimeric gene composed of the 35S promoter from 
Cauliflower Mosaic Virus (Odell etal. (1985) Nature 373:810-812), the 
hygromycin phosphotransferase gene from plasmid pJR225 (from £. coli; Gritz et 
al. (1983) Gene 25:179-188) and the 3" region of the nopaline synthase gene from 

20 the T-DNA of the Ti plasmid of Agrobacterium tumefaciens. 

To 50 \xL of a 60 mg/mL 1 ^m gold particle suspension was added (in order): 
5 \lL DNA (1 ng/^iL), 20 ^l spermidine (0.1 M), and 50 \iL CaCI 2 (2.5 M). The 
particle preparation was agitated for three minutes, spun in a microfuge for 10 
seconds and the supernatant removed. The DNA-coated particles were washed 

25 once in 400 ^L 70% ethanol and resuspended in 40 nL of anhydrous ethanol. 
The DNA/particle suspension was sonicated three times for one second each. 
Five pi of the DNA-coated gold particles were then loaded on each macro carrier 
disk. 

Approximately 300-400 mg of a two-week-old suspension culture was placed 
30 in an empty 60x1 5 mm petri dish and the residual liquid removed from the tissue 
with a pipette. Membrane rupture pressure was set at 1 100 psi and the chamber 
was evacuated to a vacuum of 28 inches mercury. The tissue was placed 
approximately 8 cm away from the retaining screen, and was bombarded three 
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times. Following bombardment, the tissue was divided in half and placed back 
into 35 ml of FN Lite medium. 

Five to seven days after bombardment, the liquid medium was exchanged 
with fresh medium. Eleven days post bombardment the medium was exchanged 
5 with fresh medium containing 50 mg/mL hygromycin. This selective medium was 
refreshed weekly. Seven to eight weeks post bombardment, green, transformed 
tissue was observed growing from untransformed, necrotic embryogenic clusters. 
Isolated green tissue was removed and inoculated into individual flasks to 
generate new, clonally propagated, transformed embryogenic suspension 

1 0 cultures. Each new line was treated as an independent transformation event. 
These suspensions were then subcultured and maintained as clusters of 
immature embryos, or tissue was regenerated into whole plants by maturation and 
germination of individual embryos. 

Two different genotypes were used in these experiments: 92B91 and 

1 5 93B82. Samples of tissue were either bombarded with the hygromycin resistance 
gene alone or with a 1 :1 mixture of the hygromycin resistance gene and the LEC1 
construct. Embryogenic cultures generated from 92B91 generally produce 
transformation events while cultures from 93B82 are much more difficult to 
transform. For transformation experiments with 92B91 , approximately equal 

20 numbers of transformants were recovered from bombardments conducted with the 
LEC1 polynucleotide as without it. Twenty-nine transformants were recovered 
from the LEC1 -treated 92B91 tissue while 27 transformants were recovered from 
tissue receiving only the hygromycin resistance gene. In contrast, transformants 
were only recovered from 93B82 tissue receiving the LEC1 polynucleotide (none 

25 were recovered from the treatment using only the hygromycin resistance gene). 
Five transformants were recovered from 93B82 tissue bombarded with the LEC1 
polynucleotide while no transformants were recovered from tissue treated with 
only the hygromycin resistance gene. These results show that the LEC1 
polynucleotide will be very valuable for gene transfer to recalcitrant genotypes of 

30 soybean. 
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WHAT IS CLAIMED IS 
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1 . An isolated nucleic acid comprising a member selected from the group 
consisting of: 

5 (a) a polynucleotide which encodes a polypeptide of SEQ ID NO: 

2, 8,10,12,14,16,18, 20, or 22; 

(b) a polynucleotide amplified from a plant nucleic acid library using 
the primers of SEQ ID NOS: 3 and 4, 5 and 6, 9 and 10, or 1 1 
and 12 or primers determined by using Vector nti Suite, InforMax 

1 0 Version 5. 

(c) a polynucleotide comprising at least 20 contiguous bases of 
SEQ ID NO: 1, 7, 9, 11, 13, 15, 17, 19, or 21; 

(d) a polynucleotide having a plant HAP3-type ccaat-box transcriptional 
activator, wherein the polynucleotide is from a plant other than 

15 Arabidopsis; 

(e) a polynucleotide having at least 60% sequence identity to SEQ 
ID NO: 1, 9, 11, 13, 17, or 21 or 65% sequence identity to SEQ 
ID NO: 15 or 19 or 70% sequence identity to SEQ ID NO: 7, 
wherein the % sequence identity is based on the entire 

20 sequence and is determined by GAP analysis using Gap Weight 

of 50 and Length Weight of 3; 

(f) a polynucleotide comprising at least 25 nucleotides in length 
which hybridizes under high stringency conditions to a 
polynucleotide having the sequence set forth in SEQ ID NO: 

25 1,7,9, 11, 13, 15, 17, 19, or21; 

(g) a polynucleotide containing a conserved region coding the 
polypeptide of SEQ ID NO: 23, wherein the polynucleotide is 
from a plant other than Arabidopsis] 

(h) a polynucleotide having the sequence set forth in SEQ ID NO: 1 , 
30 7,9,11,13,15,17, 19, or 21; and 

(i) a polynucleotide complementary to a polynucleotide of (a) 
through (h). 
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2. The isolated nucleic acid of claim 1 , wherein the transcriptional activator in 
(d) is from corn, soybean, wheat, rice, Veronia, or Argemone. 

3. The isolated nucleic acid of claim 1 wherein the polynucleotide in (f) is a 
5 functional equivalent. 



4. A vector comprising at least one nucleic acid of claim 1 . 

5. An expression cassette comprising at least one nucleic acid of claim 1 
10 operably linked to a promoter, wherein the nucleic acid is in sense or 

antisense orientation. 

6. The expression cassette of claim 5, wherein the nucleic acid is operably 
linked in antisense orientation to the promoter. 

15 

7. A host cell containing at least one expression cassette of claim 5. 



8. The host cell of claim 7 that is a plant cell. 



20 9. A transgenic plant comprising at least one expression cassette of claim 5. 

1 0. The transgenic plant of claim 9, wherein the plant is corn, soybean, 
sorghum, wheat, rice, alfalfa, sunflower, canola or cotton. 



25 11. A seed from the transgenic plant of claim 9. 
1 2. The seed from the transgenic plant of claim 1 0. 



1 3. An isolated protein comprising a member selected from the group 
30 consisting of: 

(a) a polypeptide comprising at least 25 contiguous amino acids of SEQ 
ID NO: 2, 8, 10, 12, 14, 16, 18, 20, or 22; 
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(b) a polypeptide which is a LEC1 transcriptional activator containing 
the conserved motifs of SEQ ID NO: 23, wherein the polypeptide is 
from a plant other than Arabidopsis, 

(c) a polypeptide comprising at least 60% sequence identity to SEQ ID 

5 NO: 2, 12, 14, 16, 20, or 22 or 70 % sequence identity to SEQ ID NO: 

8, 10, or 18, wherein the % sequence identity is based on the entire 
sequence and is determined by GAP analysis using Gap Weight of 12 
and Length Weight of 4; 

(d) a polypeptide encoded by a nucleic acid of claim 1 ; 

-, 0 (e) a polypeptide encoded by a nucleic acid of SEQ ID NO: 1 , 7, 9, 1 1 , 

13, 15, 17, 19, or 21; and 
(f) a polypeptide having the sequence set forth in SEQ ID NO: 2, 8, 10, 
12, 14, 16, 18, 20, or 22. 

15 14. The protein of claim 1 3, wherein the transcriptional activator of (b) is a 
LEC1 protein from corn, soybeans, wheat, rice, Veronia, or Argemone. 

15. A ribonucleic acid sequence encoding a protein of claim 1 3. 

20 16. A method for modulating LEC1 activity in a plant, comprising: 

(a) transforming a plant cell with at least one expression cassette of 
claim 5 and 

(b) regenerating a transformed plant that expresses the at least one 
nucleic acid in an amount sufficient to modulate LEC1 activity in the 

25 plant. 

17. The method of claim 16, wherein the plant is corn, soybean, sorghum, 
wheat, rice, alfalfa, sunflower, canola or cotton. 

30 18. The method of claim 1 6, wherein LEC1 activity is increased. 



1 9. The method of claim 1 6, wherein LEC1 activity is decreased. 
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20. A method for transiently modulating the level of LEC1 activity in plant cells 
comprising introducing at least one nucleic acid of claim 1 . 



21 . The method of claim 20 wherein the at least one nucleic acid is a 
5 ribonucleic acid. 



22. A method for transiently modulating the level of LEC1 activity in plant cells 
comprising introducing at least one polypeptide of claim 13. 

10 23. A method for enhancing tissue culture response in a plant cell comprising 
introducing into the plant cell at least one LEC1 polypeptide or at least one 
LEC1 polynucleotide under conditions sufficient to enhance tissue culture 
response. 

1 5 24. The method of claim 23 wherein the plant cell is transformed with at least 
one LEC1 polynucleotide. 

25. The method of claim 23 wherein at least one polynucleotide is operably 
linked to a promoter driving expression in the plant cell and growing the 
20 plant cell. 



26. The method of claim 23 wherein the plant cell is a recalcitrant cell. 



27. The method of claim 26 wherein the plant cell is an inbred plant cell. 

25 

28. A method for inducing somatic embryogenesis in a plant cell comprising 
introducing into a responsive plant cell at least one LEC1 polypeptide or at 
least one LEC1 polynucleotide, wherein the responsive plant cell is 
transformed and grown under conditions sufficient to stimulate the 

30 production of a transformed embryo and wherein the plant cell is other than 

an Arabidopsis cell. 
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29. The method of claim 28 wherein the plant cell is transformed with at least 
one LEC1 polynucleotide. 

30. The method of claim 29 wherein at least one polynucleotide is operably 
5 linked to a promoter driving expression in the plant cell. 

31 . The method of claim 28 further comprising growing the transformed embryo 
under plant growing conditions to produce a regenerated plant. 

10 32. The method of claim 28 wherein the plant cell is from corn, soybean, 
sorghum, wheat, rice, alfalfa, sunflower, canola or cotton. 

33. A plant produced by the method of claim 28. 

15 34. A method for positive selection of a transformed cell comprising introducing 
into a responsive plant cell at least one LEC1 polynucleotide or at least one 
LEC1 polypeptide and growing the transformed plant cell, wherein the 
responsive plant cell is transformed and grown under conditions sufficient 
to induce embryogenesis to provide a positive selection means. 

20 

35. The method of claim 34 wherein the plant cell is transformed with at least 
one polynucleotide. 

36. The method of claim 35 wherein at least one polynucleotide is operably 
25 linked to a promoter driving expression in a plant. 

37. The method of claim 34 further comprising altering media components to 
favor the growth of transformed plant cells. 

30 38. The method of claim 34 wherein the media components are altered to 
reduce somatic embryogenesis in non-transformed cells. 

39. The method of claim 36 wherein the polynucleotide is excised. 
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40. The method of claim 39 wherein the polynucleotide is flanked by FRT 
sequences to allow FLP mediated excision of the polynucleotide. 

41 . A method for inducing apomixis in a plant cell comprising introducing into a 
5 responsive plant cell at least one LEC1 polypeptide or at least one LEC1 

polynucleotide and growing the plant cell, wherein the introducing and 
growing is done under conditions sufficient to produce a transformed 
somatic embryo. 

10 42. The method of claim 41 wherein the plant cell is transformed with at least 
one LEC1 polynucleotide. 

43. The method of claim 44 wherein the at least one polynucleotide is operably 
linked to a promoter driving expression in a plant cell 

15 

44. The method of claim 41 further comprising suppressing in the plant cell the 
expression of an FIE polycomb polynucleotide using sense or antisense 
methods . 

20 45. The method of claim 41 further comprising growing the embryo under plant 
growing conditions to produce a regenerated plant. 

46. The method of claim 42 wherein the promoter is an inducible promoter. 

25 47. The method of claim 42 wherein the at least one polynucleotide is 
expressed in integument or nucelius tissue. 

48. A plant produced by the method of claim 41 . 

30 49. The plant of claim 48, wherein the plant is male sterile. 
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50. A method for increasing transformation efficiency comprising introducing at 
least one LEC1 polypeptide or at least one LEC1 polynucleotide into a 
responsive plant cell under conditions sufficient to increase transformation 
efficiency. 

5 

51 . The method of claim 50 wherein the transformation is conducted in medium 
that retards growth of somatic embryo growth in non-transformed plants. 

52. The method of claim 51 wherein transformation is conducted with reduced 
10 levels of auxin or no auxin. 

53. The method of claim 50 wherein the plant cell is transformed with at least 
one LEC1 polynucleotide. 

15 54. The method of claim 53 wherein the at least one polynucleotide is operably 
linked to a promoter driving expression in a plant cell 

55. The method of claim 50 wherein the plant cell is a recalcitrant cell. 

20 56. The method of claim 55 wherein the plant cell is an inbred cell. 

57. A method for increasing recovery of regenerated plants comprising 
introducing into a responsive plant cell at least one LEC1 polypeptide or at 
least one LEC1 polynucleotide and growing the plant cell, wherein the 

25 introducing and growing are under conditions sufficient to produce a 

regenerated plant. 

58. The method of claim 57 wherein the plant cell is transformed with at least 
one LEC1 polynucleotide. 



30 



59. The method of claim 58 wherein the at least one polynucleotide is operably 
linked to a promoter driving expression in a plant cell 
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60. The method of claim 57 wherein the plant cell is a recalcitrant cell. 

61 . The method of claim 60 wherein the plant cell is an inbred plant cell. 
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SEQUENCE LISTING 

<110> Pioneer Hi-Bred International, Inc. 
E. I. du Pont de Nemours & Co. 
DuPont (U.K.) Limited 

<120> Transcriptional Activator Nucleic Acids, 
Polypeptides, and Methods of Use Thereof 



<130> 0943-PCT 

<150> 60/107,643 
<151> 1998-11-09 

<160> 23 

<170> FastSEQ for Windows Version 3.0 

<210> 1 

<211> 1173 

<212> DNA 

<213> Zea mays 

<220> 

<221> CDS 

<222> (69) . . . (902) 

<400> 1 

ccacgcgtcc gccaccacac cacgagcgcg cgataaccct agctagcttc aggtagtagc 
gagagcca atg gac tec age age ttc etc cct gec gec ggc gcg gag aat 
Met Asp Ser Ser Ser Phe Leu Pro Ala Ala Gly Ala Glu Asn 
1 5 10 

ggc teg gcg gcg ggc ggc gee aac aat ggc ggc get get cag cag cat 
Gly Ser Ala Ala Gly Gly Ala Asn Asn Gly Gly Ala Ala Gin Gin His 
15 20 25 30 

gcg gcg ccg gcg ate cgc gag cag gac egg ctg atg ccg ate gcg aac 
Ala Ala Pro Ala He Arg Glu Gin Asp Arg Leu Met Pro He Ala Asn 
35 40 45 

gtg ate cgc ate atg egg cgc gtg ctg ccg gcg cac gee aag ate teg 
Val He Arg He Met Arg Arg Val Leu Pro Ala His Ala Lys He Ser 
50 55 60 

gac gac gec aag gag acg ate cag gag tgc gtg teg gag tac ate age 
Asp Asp Ala Lys Glu Thr lie Gin Glu Cys Val Ser Glu Tyr He Ser 
65 70 75 

ttc ate acg ggg gag gec aac gag egg tgc cag egg gag cag cgc aag 
Phe He Thr Gly Glu Ala Asn Glu Arg Cys Gin Arg Glu Gin Arg Lys 
80 85 90 

acc ate ace gee gag gac gtg ctg tgg gec atg age cgc etc ggc ttc 
Thr He Thr Ala Glu Asp Val Leu Trp Ala Met Ser Arg Leu Gly Phe 
95 100 105 HO 

gac gac tac gtc gag ccg etc ggc gec tac etc cac cgc tac cgc gag 
Asp Asp Tyr Val Glu Pro Leu Gly Ala Tyr Leu His Arg Tyr Arg Glu 



60 
110 



158 



206 



254 



302 



350 



398 



446 
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115 120 



130 x " 

2S5S5S = S = = 22 3 SS:! 
£ - s S E £ E = E K K = - S = S " 

160 165 

E E E E E 2 S 22 E E E S S S S i 

175 180 1 

E « S = S E S E S E E E E E 1 2 

195 200 

„. c.c r . oj, s jjt 2; ~ - s in »= e S E E 

Pro His Pro Gin Gly Ser ber nis y 22Q 
210 21:5 

S S: E E E E E 2 E E S S E E E E 

225 230 

S E E E E E 5 = E S E E E E E E 

240 245 

gjt , gc „, «. r s ~ - - s E E E S' E 

Gly Gly Gly Ser Ala Ser His inr rro ^± j 27Q 
255 260 
cac ccg =ac cog ttc gcg tac aag tagctagttc gtacgtcgtt cgacttgagc 932 
His Pro His Pro Phe Ala Tyr Lys 
275 

aagccatcga tctgctgatc tgaacgtacg ctjtjttjt. «cjc.tgc. cgt.cgt.tc J92 

ggcggctagc tctcctgttt aagttgtact Sjgattctgt cccgg gg ul2 
".iettcct tcagtctcta gtttctt.gc ^cgtagaa gtgttc , n72 
tgttgtttta gggccggggt aaaccatccg atgagattat tu 1173 



<210> 2 
<211> 278 
<212> PRT 
<213> Zea mays 



„. t «. - «• «•» '» f 0 - cly M * Glu A * n ?/ s " 

aL Ala oiy .iy «. «.» oiy «» «■ «■ sl ° G1 " f 0 ' M ' 

P„ Al. U. «, Glu Gin A.P Ar, L.u »t Pr. II. M. A»» V.l II. 



494 



542 



590 



638 



686 



734 



782 



830 



878 
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Arg He 
50 

Ala Lys 
65 

Thr Gly 

Thr Ala 

Tyr Val 

Gly Asp 
130 
Gly Gly 
145 

Ser Arg 

His His 

Pro Pro 

Pro Gin 
210 
Gly Gly 
225 

Pro Gly 
Gly Ser 
His Pro 



Met Arg Arg 

Glu Thr He 

Glu Ala Asn 
85 

Glu Asp Val 

100 
Glu Pro Leu 
115 

Ala Arg Gly 

Asp His His 

Gly Pro Val 
165 

Asp Met Gin 

180 
Pro Ala Gly 
195 

Gly Ser Ser 

Glu His Ala 

Asn Gly Gly 
245 

Ala Ser His 

260 
Phe Ala Tyr 
275 



Val Leu 
55 

Gin Glu 
70 

Glu Arg 

Leu Trp 

Gly Ala 

Val Gly 
135 
Pro His 
150 

Ser Gly 
Met His 
Pro Pro 



Pro Ala His 

Cys Val Ser 

Cys Gin Arg 
90 

Ala Met Ser 

105 
Tyr Leu His 
120 

Leu Val Pro 

Ser Met Ser 

Ala Ala Met 
170 

Ala Ala Met 
185 

His His Gly 
200 

Leu Pro Tyr 



His Tyr 
215 

Met Ala Ala Tyr Tyr 
230 

Ser Gly 



Thr Pro 
Lys 



Asp Gly Ser 
250 

Gin Gly Ser 
265 



Ala Lys 

60 
Glu Tyr 
75 

Glu Gin 

Arg Leu 

Arg Tyr 

Gly Ala 
140 
Pro Ala 
155 

Leu Pro 

Tyr Gly 

Gly Phe 

Ala Tyr 
220 
Gly Gly 
235 

Gly Ser 
Gly Gly 



He Ser Asp Asp 
He Ser 
Arg Lys 



Gly Phe 
110 
Arg Glu 
125 

Ala Pro 

Ala Met 

His His 

Gly Thr 
190 
Leu Met 
205 

Glu Pro 

Ala Ala 

Gly Gly 

Leu Glu 
270 



Phe He 
80 

Thr He 
95 
Asp Asp 



Phe Glu 

Ser Arg 

Leu Lys 
160 
His His 
175 

Ala Val 

Pro His 

Thr Tyr 

Tyr Ala 
240 
Gly Gly 
255 

His Pro 



<210> 3 
<211> 20 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> primer 

<400> 3 20 
tagtagcgag agccaatgga 

<210> 4 
<211> 20 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> primer 



<400> 4 20 
gccgggacag aatcacagta 

<210> 5 
<211> 20 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> primer 
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<400> 5 
tagtagcgag agccaatgga 

<210> 6 
<211> 20 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> primer 

<400> 6 
cccggcccta aaacaacaca 



<210> 7 
<211> 481 
<212> DNA 

<213> Argemone mexicana 



<220> 

<221> CDS 

<222> (44) . . . (481) 

<221> misc_feature 
<222> (1) . . . (481) 
<223> n = A,T,C or G 



<400> 7 

cgagagaaag agttggtgaa gaagaagaag aagttgaaaa gag atg gaa cgt ggt 

Met Glu Arg Gly 
1 

ggt ggt ggt ggt ggt agt ggt ggt ggt ttc cat gga tat cag aaa etc 
Gly Gly Gly Gly Gly Ser Gly Gly Gly Phe His Gly Tyr Gin Lys Leu 
5 10 15 20 

cca aaa tea aac tec get gga atg atg etc teg gag eta teg aat aac 
Pro Lys Ser Asn Ser Ala Gly Met Met Leu Ser Glu Leu Ser Asn Asn 
25 30 35 

aac aac aat att gac gta aac tct aca tgt act gta cga gag caa gat 
Asn Asn Asn He Asp Val Asn Ser Thr Cys Thr Val Arg Glu Gin Asp 
40 45 50 

cga tac atg cca att get aat gtg ate agg ate atg cgt aag gta ctt 
Arg Tyr Met Pro He Ala Asn Val He Arg He Met Arg Lys Val Leu 
55 60 65 

cct act cat gee aag ate tct gac gat gee aaa gaa act ate caa gaa 
Pro Thr His Ala Lys He Ser Asp Asp Ala Lys Glu Thr He Gin Glu 
70 75 80 

tgt gtc tea gaa tac ate agt ttc ate aca agt gaa gee aat gat cgt 
Cys Val Ser Glu Tyr He Ser Phe He Thr Ser Glu Ala Asn Asp Arg 
85 90 95 100 

tgc caa cgt gaa caa aga aag aca ate aca get gaa gat gtt tta tgg 
Cys Gin Arg Glu Gin Arg Lys Thr He Thr Ala Glu Asp Val Leu Trp 
105 HO H5 



55 



103 



151 



199 



247 



295 



343 



391 
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gcg atg age aaa eta ggg ntt gat gag tac att gaa cot eta act ctt 
Ala Met Ser Lys Leu Gly Xaa Asp Glu Tyr lie Glu Pro Leu Thr Leu 
120 125 130 



439 



tac ctt caa cgt tat cgt gag ttt gaa ggt gna cgt tgg tea 
Tyr Leu Gin Arg Tyr Arg Glu Phe Glu Gly Xaa Arg Trp Ser 
135 140 145 



481 



<210> 8 
<211> 146 
<212> PRT 

<213> Argemone mexicana 
<220> 

<221> VARIANT 

<222> (1) • . . (146) 

<223> Xaa - Any Amino Acid 





<400> 


8 




















Met 


Glu 


Arg 


Gly Gly 


Gly Gly 


Gly Gly Ser 


Gly Gly 


Gly 


Phe 


His 


Gly 


1 




5 








10 








15 




Tyr 


Gin 


Lys 


Leu Pro 


Lys 


Ser 


Asn 


Ser Ala 


Gly Met 


Met 


Leu 


Ser 


Glu 






20 








25 






30 






Leu 


Ser 


Asn 


Asn Asn 


Asn 


Asn 


He Asp Val 


Asn Ser 


Thr 


Cys 


Thr 


Val 






35 








40 






45 








Arg 


Glu 


Gin 


Asp Arg 


Tyr 


Met 


Pro 


He Ala 


Asn Val 


He 


Arg 


He 


Met 


50 








55 






60 










Arg Lys 


Val 


Leu Pro 


Thr 


His 


Ala 


Lys He 


Ser Asp 


Asp 


Ala 


Lys 


Glu 


65 








70 








75 








80 


Thr 


He 


Gin 


Glu Cys 
85 


Val 


Ser 


Glu 


Tyr He 
90 


Ser Phe 


He 


Thr 


Ser 
95 


Glu 


Ala 


Asn 


Asp 


Arg Cys 


Gin 


Arg 


Glu 


Gin Arg 


Lys Thr 


He 


Thr 


Ala 


Glu 






100 








105 






110 






Asp 


Val 


Leu 


Trp Ala 


Met 


Ser 


Lys 


Leu Gly Xaa Asp 


Glu 


Ty r 


He 


Glu 




115 








120 






125 








Pro 


Leu 


Thr 


Leu Tyr 


Leu 


Gin 


Arg 


Tyr Arg 


Glu Phe 


Glu 


Gly 


Xaa 


Arg 



130 135 140 

Trp Ser 
145 



<210> 9 

<211> 942 

<212> DNA 

<213> Glycine max 

<220> 

<221> CDS 

<222> (3) . . . (722) 



<400> 9 

gc acg age tct ctt ata ate aca cac aca cct acc tta ata get atg 47 

Thr Ser Ser Leu He He Thr His Thr Pro Thr Leu He Ala Met 
15 10 15 



gaa act gga ggc ttt cac ggc tac cgc aag etc ccc aac acc acc get 
Glu Thr Gly Gly Phe His Gly Tyr Arg Lys Leu Pro Asn Thr Thr Ala 
20 25 30 



95 
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ggg ttg aag ctg tea gtg tea gac atg aac atg agg cag cag gta gca 
Gly Leu Lys Leu Ser Val Ser Asp Met Asn Met Arg Gin Gin Val Ala 
35 40 45 

tea tea gat cac agt gca gee aca gga gag gag aac gaa tgc acg gtg 
Ser Ser Asp His Ser Ala Ala Thr Gly Glu Glu Asn Glu Cys Thr Val 



50 



55 60 



agg gag caa gac agg ttc atg cea ate gee aac gtg att agg ate atg 
Arg Glu Gin Asp Arg Phe Met Pro lie Ala Asn Val lie Arg lie Met 



65 



70 75 



cgc aag att etc cct cca cac gca aaa ate teg gac gat gca aaa gaa 
Arg Lys lie Leu Pro Pro His Ala Lys He Ser Asp Asp Ala Lys Glu 
y 85 9° 95 



80 



aca ate caa gag tge gtg tet gag tac ate age ttc ate aca ggt gag 
Thr He Gin Glu Cys Val Ser Glu Tyr He Ser Phe lie Thr Gly Glu 



100 



105 HO 



geg aac gag cgt tgc eag agg gag cag egg aag acc ata ace gca gag 
Ala Asn Glu Arg Cys Gin Arg Glu Gin Arg Lys Thr lie Thr Ala Glu 
115 120 125 

gac gtg ctt tgg gee atg age aag ctt gga ttc gac gac tac ate gaa 
Asp Val Leu Trp Ala Met Ser Lys Leu Gly Phe Asp Asp Tyr lie Glu 

135 140 



130 



ccg ttg acc atg tac ctt cac cgc tac cgt gaa ctt gag ggt gac cgc 
Pro Leu Thr Met Tyr Leu His Arg Tyr Arg Glu Leu Glu Gly Asp Arg 

150 I 55 



145 



acc tet atg agg ggt gaa cca etc ggg aag agg act gtg gaa tac gee 
Thr Ser Met Arg Gly Glu Pro Leu Gly Lys Arg Thr Val Glu Tyr Ala 

165 170 I 75 



160 



acg ctt ggt gtt get act get ttt gte cct cca ecc tat cat cac cac 
Thr Leu Gly Val Ala Thr Ala Phe Val Pro Pro Pro Tyr His His His 

185 i9 ° 



180 



aat aaa tac ttt ggt get gee atg ecc atg ggg act tac gtt agg gaa 
Asn 111 Tyr Phe Sly Ala Ala Met Pro Met Gly Thr Tyr Val Arg Glu 

200 205 



195 



geg cca cca aat aca gee tec tec cat cac cac cac cac cac cac cac 
Ala Pro Pro Asn Thr Ala Ser Ser His His His His His His His His 



210 



215 220 



143 



191 



239 



287 



335 



383 



431 



479 



527 



575 



623 



671 



719 



cac eat get cgt gga ate tec aat get cat gaa cea aat get cgc tec 
His His Ala Arg Gly He Ser Asn Ala His Glu Pro Asn Ala Arg Ser 
225 230 235 

ata taaaattata taattatgac taggattcag aacaagactt gatgatgatt 
He 
240 

agettaactc teagtaattg gtgetagagt actactgttg ttgaggatac "tattttat 
aattaagggc tgggaaggga gttagtatat tectaatcet aactatgtge atetttaatt 892 
tatgaaatca ctttgtttta acctttgatg aaaaaaaaaa aaaaaaaaaa 



772 

832 
892 
942 
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<210> 10 

<211> 240 
<212> PRT 
<213> Glycine max 

<400> 10 

Thr Ser Ser Leu lie lie Thr His Thr Pro Thr Leu lie Ala Met Glu 

1 5 1° 

Thr Gly Gly Phe His Gly Tyr Arg Lys Leu Pro Asn Thr Thr Ala Gly 

20 25 3° 

Leu Lys Leu Ser Val Ser Asp Met Asn Met Arg Gin Gin Val Ala Ser 

35 40 45 

Ser Asp His Ser Ala Ala Thr Gly Glu Glu Asn Glu Cys Thr Val Arg 



Glu Gin Asp Arg Phe Met Pro lie Ala Asn Val He Arg He Met Arg 
65 70 75 

Lvs He Leu Pro Pro His Ala Lys He Ser Asp Asp Ala Lys Glu Thr 

85 90 95 , 

He Gin Glu Cys Val Ser Glu Tyr He Ser Phe He Thr Gly Glu Ala 

100 105 110 

Asn Glu Arg Cys Gin Arg Glu Gin Arg Lys Thr He Thr Ala Glu Asp 

115 120 125 

Val Leu Trp Ala Met Ser Lys Leu Gly Phe Asp Asp Tyr He Glu Pro 

130 135 140 

Leu Thr Met Tyr Leu His Arg Tyr Arg Glu Leu Glu Gly Asp Arg Thr 

145 150 I 55 , 17 

Ser Met Arg Gly Glu Pro Leu Gly Lys Arg Thr Val Glu Tyr Ala Thr 

165 170 175 

Leu Gly Val Ala Thr Ala Phe Val Pro Pro Pro Tyr His His His Asn 

180 185 I 90 

Glv Tyr Phe Gly Ala Ala Met Pro Met Gly Thr Tyr Val Arg Glu Ala 

195 200 205 

Pro Pro Asn Thr Ala Ser Ser His His His His His His His His His 

210 215 220 

His Ala Arg Gly He Ser Asn Ala His Glu Pro Asn Ala Arg Ser lie 
225 230 235 

<210> 11 
<211> 905 
<212> DNA 

<213> Veronia mespilifolia 

<220> 

<221> CDS 

<222> (58) . . . (699) 



gcacgagccfatttctagag agagaacgag agagaattct ctaaagagga aaaatag atg 60 

1 



aaa cat qqa gga ggt ttc cat ggc tac cac agg etc ccc ate cac cct 
llu Arg G?y Sy Sy Phe His Gly Tyr His Arg Leu Pro lie His Pro 
5 10 15 

aca tct gga ate caa caa teg gat atg aag eta aag eta cca gaa atg 
Thr Ser Gly He Gin Gin Ser Asp Met Lys Leu Lys Leu Pro Glu Met 



108 



156 



20 25 
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acc aac aat aac teg tec act gat gac aat gag tgc acc gtt cga gaa 
Thr Asn Asn Asn Ser Ser Thr Asp Asp Asn Glu Cys Thr Val Arg Glu 



35 



40 45 



caq gac cgc ttc atg ccg ata gca aac gtg ate cgc ate atg egg aag 
Gin Asp Arg Phe Met Pro lie Ala Asn Val He Arg He Met Arg Lys 



50 " 



ate ctt cct cca cat gee aag ate tct gat gat gee aaa gag acg ate 
He Leu Pro Pro His Ala Lys He Ser Asp Asp Ala Lys Glu Thr He 



70 



75 80 



caa gaa tgt gtt tea gag tac att age ttt gtc aca ggc gag gca aat 
Gin Glu Cys Val Ser Glu Tyr He Ser Phe Val Thr Gly Glu Ala Asn 



85 



90 95 



gac cgc tgc cag cgt gag caa agg aag acc ate aca get gaa gat gtg 

Asp Arg Cys Gin Arg Glu Gin Arg Lys Thr He Thr Ala Glu Asp Val 

100 105 HO 

etc tgg get atg age aaa ctg gga ttt gat gat tat ate gag ccc ttg 

Leu Trp Ala Met Ser Lys Leu Gly Phe Asp Asp Tyr He Glu Pro Leu 



115 



120 125 



act gtg tat etc cat cgc tac agg gag ttt gat ggt ggc gaa cgt gga 
Thr Val Tyr Leu His Arg Tyr Arg Glu Phe Asp Gly Gly Glu Arg Gly 
130 135 140 145 

tec ata agg ggt gag ccc ctt gtg aag agg agt act tct gat cct ggt 
Ser lie Arg Gly Glu Pro Leu Val Lys Arg Ser Thr Ser Asp Pro Gly 
150 155 I 60 

cac ttt ggg atg get tct ttt gtg cct get ttt cat atg ggt cat cat 
His Phe Gly Met Ala Ser Phe Val Pro Ala Phe His Met Gly His His 

170 I 75 



165 



aac ggc ttc ttt ggt cct gca age att ggt ggt ttc ctg aaa gac cca 
Asn Gly Phe Phe Gly Pro Ala Ser He Gly Gly Phe Leu Lys Asp Pro 
180 185 190 

teg agt get ggc cct teg gga cct gca gtc get ggg ttt gag ccg tat 
III sir Ala Gly Pro Ser Gly Pro Ala Val Ala Gly Phe Glu Pro Tyr 
195 200 205 

get cag tgt aaa gag taactgcaaa aagtaggggt tgggatgaga tgatgatgat 

Ala Gin Cys Lys Glu 

210 

ggtggtggtg gtggtggttt gttttgtttt gttetttett ttttttttct tctttctttt 
cttggtcatt gaggaacaaa ettacattgg ttcaetttgg ctaggcatgt aaacggttaa 
catgettate aagtagtagt tttcgatcaa aaaaaaaaaa aaaaaa 

<210> 12 
<211> 214 
<212> PRT 

<213> Veronia mespilifolia 

Met Glu^Arg Gly Gly Gly Phe His Gly Tyr His Arg Leu Pro He His 
I 5 10 15 



204 



252 



300 



348 



396 



444 



492 



540 



588 



636 



684 



739 



799 
859 
905 
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Pro 


Thr 


Ser 


Gly 


Tl- 


vj±n 


VjJ.I1 


Ser 


Asp 


Met 


Lys 


Leu 


Lvs 


Leu 


Pro 


Glu 






20 










25 










30 


Val 


Arg 


Met 


Thr 


Asn 


Asn 


Asn 


Ser 


C A V- 


Thr 


Asp Asp 


As n 


Glu 


Cvs 


Thr 




35 










40 










45 


He 


Met 


Arg 


Glu 


Gin 


Asp Arg 


Phe 


Met 


Pro 


Tie 

lie 


Ala 


Asn 


Val 


He 


Arg 




50 










55 










60 






Glu 


Thr 
80 


Lys 
65 


He 


Leu 


Pro 


Pro 


tils 
/ u 


Ala 


Lys 


He 


Ser 


Asp 
75 


Asp 


Ala 


Lvs 


He 


Gin 


Glu 


Cys 


Val 


Ser 


Glu 


Tyr 


He 


Ser 


cIJc 


Val 


Thr 


Glv 


Glu 


Ala 






oo 










90 










95 




Asn 


Asp 


Arg 


Cys 
100 


Gin 


Arg 


Glu 


Gin 


Arg 
105 


Lys 




He 


Thr 


Ala 
110 


Glu 


Asp 


Val 


Leu 


Trp 


Ala 


Met 


Ser 


Lys 


Leu 


Gly 


Phe 


Asp 


Asp 


Tvr 


He 


Glu 


Pro 






115 










120 










125 




Glu 




Leu 


Thr 


Val 


Tyr 


Leu 


His 


Arg 


Tyr 


Arg 


Glu 


Phe 


Asp 


Gly 


Gly 


Arg 




130 








135 










140 








Pro 
160 


Gly 
145 


Ser 


He 


Arg 


Gly 


Glu 
150 


Pro 


Leu 


Val 


Lys 


Arg 
155 


Ser 


Thr 


Ser 


Asp 


Gly 


hi S 


Phe 


Gly Met 
165 


Ala 


Ser 


Phe 


Val 


Pro 
170 


Ala 


Phe His Met Gly His 
175 


HIS 


Asn 


Gly 


Phe 


Phe 


Glv 


Pro 


Ala 


Ser 


He 


Gly 


Gly 


Phe 


Leu 


Lys 


Asp 






180 










185 










190 






Pro 


Ser 


Ser 
195 


Ala 


Gly 


Pro 


Ser 


Gly 
200 


Pro 


Ala 


Val 


Ala 


Gly 

205 


Phe 


Glu 


Pro 


Tyr 


Ala 
210 


Gin 


Cys 


Lys 


Glu 























<210> 13 

<211> 763 

<212> DNA 

<213> Zea mays 

<220> 

<221> CDS 

<222> (1) . . . (480) 

<400> 13 

gca cga ggc aag acc gtc acc tec gag gac ate gtg tgg gec atg age 

Ala Arg Gly Lys Thr Val Thr Ser Glu Asp lie Val Trp Ala Met Ser 
1 5 10 15 

cgc etc ggc ttc gac gac tac gtc gcg ccc etc ggc gee ttc etc cag 
Arg Leu Gly Phe Asp Asp Tyr Val Ala Pro Leu Gly Ala Phe Leu Gin 
20 25 30 

cgc atg cgc gac gac age gac cac ggc ggt gaa gag cgc ggc ggc cct 
Arg Met Arg Asp Asp Ser Asp His Gly Gly Glu Glu Arg Gly Gly Pro 
35 40 45 

gca ggg cgt ggt ggc teg cgc cgc ggc teg teg tec ttg ccg etc cac 
Ala Gly Arg Gly Gly Ser Arg Arg Gly Ser Ser Ser Leu Pro Leu Has 
50 55 6° 

tqc ccg cag cag atg cac cac ctg cac cca gec gtc tgc egg cgt ccg 
Cys Pro Gin Gin Met His His Leu His Pro Ala Val Cys Arg Arg Pro 
3 75 80 



65 70 



cac cag age gtg teg cct get gca gga tac gec gtc egg ccc gtt ccc 
His Gin Ser Val Ser Pro Ala Ala Gly Tyr Ala Val Arg Pro Val Pro 
85 90 95 



48 



96 



144 



192 



240 



288 
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cqc ccg atg cca gcc cgt ggg tac cgc atg cag ggc gga gac cac cgc 
Arq Pro Met Pro Ala Arg Gly Tyr Arg Met Gin Gly Gly Asp His Arg 
100 105 HO 

age gtg ggc ggc gtg get ccc tgc age tac gga ggg gcg etc gtc cag 
Ser Val Gly Gly Val Ala Pro Cys Ser Tyr Gly Gly Ala Leu Val Gin 
115 120 125 

gcc ggt gga acc caa cac gtt gtt gga ttc cac gac gac gag gca age 
Ala Gly Gly Thr Gin His Val Val Gly Phe His Asp Asp Glu Ala Ser 
130 135 140 

tct teg agt gaa aat ccg ccg ccg gag ggg cgt gcc get ggc teg aac 
Ser Ser Ser Glu Asn Pro Pro Pro Glu Gly Arg Ala Ala Gly Ser Asn 
145 150 155 160 

tagectaget tctcagttcc ccgtgtacaa taagaggggc ggtcgcggcg ccgcgccgcg 
cccttgggtt gggccgggcg etatgetgea gtttggtttg taaactaacg agectagggt 
agctggtgca cgcgcgccac ctcgccggac gtcgccgtcg tegteggcat ggacttaacc 
ggcgggccct gttgttattt ctcaagtttg tagccaacgc actgttcggt gcgttccata 
atttaattta ccatgttgct ctcgaaaaaa aaaaaaaaaa aaa 



336 



384 



432 



480 



540 
600 
660 
720 
763 



<210> 14 
<211> 160 
<212> PRT 
<213> Zea mays 



<400> 
Ala Arg Gly 
1 

Arg Leu Gly 

Arg Met Arg 
35 

Ala Gly Arg 
50 

Cys Pro Gin 
65 

His Gin Ser 

Arg Pro Met 

Ser Val Gly 
115 

Ala Gly Gly 
130 

Ser Ser Ser 
145 



14 

Lys Thr Val 

5 

Phe Asp Asp 
20 

Asp Asp Ser 

Gly Gly Ser 

Gin Met His 
70 

Val Ser Pro 
85 

Pro Ala Arg 
100 

Gly Val Ala 

Thr Gin His 

Glu Asn Pro 
150 



Thr Ser 

Tyr Val 

Asp His 

40 
Arg Arg 
55 

His Leu 

Ala Ala 

Gly Tyr 

Pro Cys 
120 
Val Val 
135 

Pro Pro 



Glu Asp 
10 

Ala Pro 
25 

Gly Gly 

Gly Ser 

His Pro 

Gly Tyr 

90 
Arg Met 
105 

Ser Tyr 
Gly Phe 
Glu Gly 



lie Val Trp Ala 
Leu Gly 
Glu Glu 



Ser Ser 
60 

Ala Val 
75 

Ala Val 

Gin Gly 

Gly Gly 

His Asp 
140 
Arg Ala 
155 



Ala Phe 
30 

Arg Gly 
45 

Leu Pro 



Cys Arg 

Arg Pro 

Gly Asp 
110 
Ala Leu 
125 

Asp Glu 
Ala Gly 



Met Ser 
15 

Leu Gin 

Gly Pro 

Leu His 

Arg Pro 
80 

Val Pro 
95 

His Arg 

Val Gin 

Ala Ser 

Ser Asn 
160 



<210> 15 
<211> 622 
<212> DNA 
<213> Zea mays 



<220> 

<221> CDS 

<222> (3) • . . (622) 



<221> miscjEeature 
<222> (1) . . . (622) 
<223> n ~ A,T,C or G 
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<400> 15 . . 7 

gc atg aat aat ccc caa aac cct aaa gcc agt get cct tgc acc ttg 
Met Asn Asn Pro Gin Asn Pro Lys Ala Ser Ala Pro Cys Thr Leu 



1 



5 10 



cca ccg gag ctt ccc aaa gaa gca gtg gcg acc gac gaa gca ccg ccg 
Pro Pro Glu Leu Pro Lys Glu Ala Val Ala Thr Asp Glu Ala Pro Pro 



25 30 

cca atg ggc aac aac aac aac acg gaa teg gcg acg gcg acg atg gtc 
Pro Met Gly Asn Asn Asn Asn Thr Glu Ser Ala Thr Ala Thr Met Val 



35 



40 45 



egg gag cag gac egg ctg atg ccc gtg gcc aac gtg tec cgc ate atg 
Arq Glu Gin Asp Arg Leu Met Pro Val Ala Asn Val Ser Arg He Met 
50 55 60 

cgc eaa gtg ctg cct ccg tae gee aag ate tec gac gac gee can gaa 
Arq Gin Val Leu Pro Pro Tyr Ala Lys He Ser Asp Asp Ala Xaa Glu 
65 70 75 

gtn ate caa gaa ttg ctn ttc gga att tea tea ctt ncg tec tgg cga 
Xaa He Gin Glu Leu Xaa Phe Gly lie Ser Ser Leu Xaa Ser Trp Arg 
80 85 90 95 

ggc gaa acg aag egg tgc cac acc gag cgc cgc aag acc gtc acc tec 
Gly Glu Thr Lys Arg Cys His Thr Glu Arg Arg Lys Thr Val Thr Ser 
100 105 H° 



145 



teg tct net tgg cgc tec ctt gcc gca ana gat gac aac ttg cac caa 
Ser Ser Xaa Trp Arg Ser Leu Ala Ala Xaa Asp Asp Asn Leu His Gin 

170 1'5 



160 I 65 



acg tct gee ggg ntc gga cca aaa ctn ttc cct gtt gca gga ata ccc 
Thr Ser Ala Gly Xaa Gly Pro Lys Xaa Phe Pro Val Ala Gly lie Pro 

185 190 



180 



ate cng ggc ent tec ccc ccn aat cea acc att tgg ttt ccc ctt gc 
Val Xaa Gly Xaa Ser Pro Xaa Asn Pro Thr lie Trp Phe Pro Leu 
195 200 205 



<210> 16 
<211> 206 
<212> PRT 
<213> Zea mays 

<220> 



95 



143 



191 



239 



287 



335 



383 



431 



gaa gac ate gtg tgg gcc atg age cgc etc ggc ttc gac gac tac gtc 
Glu Asp lie Val Trp Ala Met Ser Arg Leu Gly Phe Asp Asp Tyr Val 
115 120 125 

gcg ccc etc ggc gcc ttc etc cag cgc atg cgc gac nac age gaa cac 
Ala Pro Leu Gly Ala Phe Leu Gin Arg Met Arg Asp Xaa Ser Glu His 
130 135 I 40 

ggg ggt gaa aac gcg gcg gcc tgc ang ggg tng tgg ten cgc cgc ggg 479 
Gly Gly Glu Asn Ala Ala Ala Cys Xaa Gly Xaa Trp Xaa Arg Arg Gly 

150 155 



527 



575 



622 
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<221> VARIANT 

<222> (1) - . - (206) 

<223> Xaa = Any Amino Acid 



<400> 16 , 
Met Asn Asn Pro Gin Asn Pro Lys Ala Ser Ala Pro Cys Thr Leu Pro 

1 5 10 15 

Pro Glu Leu Pro Lys Glu Ala Val Ala Thr Asp Glu Ala Pro Pro Pro 

20 25 30 

Met Gly Asn Asn Asn Asn Thr Glu Ser Ala Thr Ala Thr Met Val Arg 

35 40 45 

Glu Gin Asp Arg Leu Met Pro Val Ala Asn Val Ser Arg lie Met Arg 

50 55 60 

Gin Val Leu Pro Pro Tyr Ala Lys He Ser Asp Asp Ala Xaa Glu Xaa 
65 70 75 80 

He Gin Glu Leu Xaa Phe Gly lie Ser Ser Leu Xaa Ser Trp Arg Gly 

85 90 95 

Glu Thr Lys Arg Cys His Thr Glu Arg Arg Lys Thr Val Thr Ser Glu 

100 105 HO 

Asp He Val Trp Ala Met Ser Arg Leu Gly Phe Asp Asp Tyr Val Ala 

115 120 125 

Pro Leu Gly Ala Phe Leu Gin Arg Met Arg Asp Xaa Ser Glu His Gly 

130 135 140 

Gly Glu Asn Ala Ala Ala Cys Xaa Gly Xaa Trp Xaa Arg Arg Gly Ser 
145 150 155 160 

Ser Xaa Trp Arg Ser Leu Ala Ala Xaa Asp Asp Asn Leu His Gin Thr 

165 170 175 

Ser Ala Gly Xaa Gly Pro Lys Xaa Phe Pro Val Ala Gly He Pro Val 

180 185 190 

Xaa Glv Xaa Ser Pro Xaa Asn Pro Thr He Trp Phe Pro Leu 
195 200 205 



<210> 17 

<211> 1121 

<212> DNA 

<213> Glycine max 



<220> 

<221> CDS 

<222> (3) . . . (1121) 



<400> 17 - 
gc acg agg gaa act gga ggc ttt cat ggc tac cgc aag etc ccc aac 
Thr Arg Glu Thr Gly Gly Phe His Gly Tyr Arg Lys Leu Pro Asn 

10 15 



1 5 



aca acc tct ggg ttg aag ctg tea gtg tea gac atg aac atg aac atg 
Thr Thr Ser Gly Leu Lys Leu Ser Val Ser Asp Met Asn Met Asn Met 

25 30 



20 



agg cag cag cag gta gca tea tea gat cag aac tgc age aac cac agt 
Arg Gin Gin Gin Val Ala Ser Ser Asp Gin Asn Cys Ser Asn His Ser 

40 45 



35 



gca gca gga gag gag aac gaa tgc acg gtg agg gag caa gac agg ttc 
Ala Ala Gly Glu Glu Asn Glu Cys Thr Val Arg Glu Gin Asp Arg Phe 



50 



55 60 



cca 



atg cca ate get aac gtg ata egg ate atg cgc aag att etc cct 
Met Pro He Ala Asn Val He Arg He Met Arg Lys He Leu Pro Pro 
65 70 75 



95 



143 



191 



239 
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cac gca aaa ate tec gat gat gca aag gag aca ate caa gag tgc gtg 
lis Ma Lys lie Ser Asp Asp Ala Lys Glu Thr He Gin Glu Cys Val 

85 90 



80 



tea aaq tac ate age ttc ate ace ggg gag gec aac gag cgt tgc cag 
Ser Glu Tyr lie Ser Phe lie Thr Gly Glu Ala Asn Glu Arg Cys Gin 

105 110 



100 



agg gag cag cgc aag ace ata acc gea gag gac gtg ctt tgg gca atg 
Arg 111 Gin Arg Lys Thr lie Thr Ala Glu Asp Val Leu Trp Ala Met 



115 



aat aaq ctt gga ttc gac gac tac ate gaa ccg tta acc atg tac ctt 
Ser Lys Leu Sy Phe Asp Asp Tyr He Glu Pro Leu Thr Met Tyr Leu 

135 140 



130 



cac cgc tac cgt gag ctg gag ggt gac cgc acc tct atg agg ggt gaa 
Sis Arg Tyr Arg Glu Leu Glu Gly Asp Arg Thr Ser Met Arg Gly Glu 
145 150 155 

ccg etc ggg aag agg act gtg gaa tat gec acg ctt get act get ttt 
Pro Leu Gly Lys Arg Thr Val Glu Tyr Ala Thr Leu Ala Thr Ala Phe 
160 1" I 70 1/b 

atg ccg cca ccc ttt cat cac cac aat ggc tac ttt ggt get gec atg 
Val Pro Pro Pro Phe His His His Asn Gly Tyr Phe Gly Ala Ala Met 
180 I 85 190 

ccc atg ggg act tac gtt agg gaa acg cca cca aat get gcg tea tct 
Pro Me? Gly Thr Tyr Val Arg Glu Thr Pro Pro Asn Ala Ala Ser Ser 
195 200 205 

cat cac cat cat gga ate tec aat get cat gaa cca aat get cgc tec 
Ts Tis His His lly lie Ser Asn Ala His Glu Pro Asn Ala Arg Ser 
210 215 220 

ata taa aat taa tga aga gta ctg ttc agt agg aga aca aga ctt ctt 
ne * Asn * * Arg Val Leu Phe Ser Arg Arg Thr Arg Leu Leu 



225 



230 235 



qga ctt gat tag ctt aac tct cag tga ttg gtg tta gag tac tgt tgt 
Gly Leu Asp * Leu Asn Ser Gin * Leu Va] L Leu Glu Tyr Cys Cy. 



240 



taa aaa tgg tta att tta taa tta agg get ggg aat tgg gga gtt agt 
2 G?y JS Leu He Leu * Leu Arg Ala Gly Asn Trp Gly Val Ser 
255 260 



ata tat tec taa tec taa tta tgt gea tet tta att tat gga ata act 
He Tyr Ser * Ser * Leu Cys Ala Ser Leu lie Tyr Gly lie Thr 



265 



270 

tta ttt ttt gtt tta act tct gat aat ttg gat ttt ctg atg ttt aat 
Tel Z Phe val Leu Thr Ser Asp Asn Leu Asp Phe Leu Met Phe Asn 
280 285 290 

gtg gtt ttg tet ate cet tat taa cag tgc caa get taa ggt ttt age 
Val Val Leu Ser lie Pro Tyr * Gin Cys Gin Ala Gly Phe Ser 



287 



335 



383 



431 



479 



527 



575 



623 



671 



719 



767 



815 



863 



911 



959 



295 



300 305 
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cat get cca aaa tgg aat act tgt act gtt atg ttg ttc tgg tag tga 1007 
His Ala Pro Lys Trp Asn Thr Cys Thr Val Met Leu Phe Trp * 
310 315 320 

tqq tga tga aac ctg caa gtt atg ttt atg tat aaa gec act att gat 
Trp * * Asn Leu Gin Val Met Phe Met Tyr Lys Ala Thr lie Asp 



325 



330 335 



caa aat tag aga aat tat cat tta ata agt ate etc cca tgt taa ttt 
Gin Asn * Arg Asn Tyr His Leu lie Ser He Leu Pro Cys * Phe 
340 345 350 

taa aaa aaa aaa aaa aaa 
+ Lys Lys Lys Lys Lys 
355 



1055 



1103 



1121 



<210> 18 

<211> 355 

<212> PRT 

<213> Glycine max 

<400> 18 

Thr Arq Glu Thr Gly Gly Phe His Gly Tyr Arg Lys Leu Pro Asn Thr 

1 5 10 I 5 

Thr Ser Gly Leu Lys Leu Ser Val Ser Asp Met Asn Met Asn Met Arg 

20 25 30 

Gin Gin Gin Val Ala Ser Ser Asp Gin Asn Cys Ser Asn His Ser Ala 

35 40 45 

Ala Gly Glu Glu Asn Glu Cys Thr Val Arg Glu Gin Asp Arg Phe Met 

50 55 60 

Pro He Ala Asn Val He Arg He Met Arg Lys He Leu Pro Pro His 
65 70 75 80 

Ala Lys lie Ser Asp Asp Ala Lys Glu Thr He Gin Glu Cys Val Ser 

85 90 95 

Glu Tyr He Ser Phe He Thr Gly Glu Ala Asn Glu Arg Cys Gin Arg 

100 105 HO 

Glu Gin Arg Lys Thr He Thr Ala Glu Asp Val Leu Trp Ala Met Ser 

115 120 125 

Lys Leu Gly Phe Asp Asp Tyr He Glu Pro Leu Thr Met Tyr Leu His 

130 135 140 

Arg Tyr Arg Glu Leu Glu Gly Asp Arg Thr Ser Met Arg Gly Glu Pro 
145 150 155 160 

Leu Gly Lys Arg Thr Val Glu Tyr Ala Thr Leu Ala Thr Ala Phe Val 

165 170 175 

Pro Pro Pro Phe His His His Asn Gly Tyr Phe Gly Ala Ala Met Pro 

180 185 190 

Met Gly Thr Tyr Val Arg Glu Thr Pro Pro Asn Ala Ala Ser Ser His 

!95 200 205 

His His His Gly He Ser Asn Ala His Glu Pro Asn Ala Arg Ser He 

2io 215 220 

Asn Arg Val Leu Phe Ser Arg Arg Thr Arg Leu Leu Gly Leu Asp Leu 
225 230 235 240 

Asn Ser Gin Leu Val Leu Glu Tyr Cys Cys Gly Trp Leu lie Leu Leu 

245 250 255 

Arg Ala Gly Asn Trp Gly Val Ser He Tyr Ser Ser Leu Cys Ala Ser 

260 265 270 

Leu lie Tyr Gly He Thr Leu Phe Phe Val Leu Thr Ser Asp Asn Leu 

275 280 285 

Asp Phe Leu Met Phe Asn Val Val Leu Ser lie Pro Tyr Gin Cys Gin 
290 295 300 
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Ala Gly Phe Ser His Ala Pro Lys Trp Asn Thr Cys Thr Val Met Leu 
305 310 315 320 

Phe Trp Trp Asn Leu Gin Val Met Phe Met Tyr Lys Ala Thr He Asp 

325 330 335 

Gin Asn Arg Asn Tyr His Leu He Ser He Leu Pro Cys Phe Lys Lys 
340 345 350 



Lys Lys Lys 
355 

<210> 19 

<211> 796 

<212> DNA 

<213> Glycine max 

<220> 

<221> CDS 

<222> {!)..- (513) 

<400> 19 

gca cga gca atg gcg gga gtg agg gaa cag gac cag tac atg ccg ata 

Ala Arg Ala Met Ala Gly Val Arg Glu Gin Asp Gin Tyr Met Pro He 

1 5 10 15 

gcg aac gtg ata agg ate atg cgt egg att ctg cca gcg cac gcg aag 
Ala Asn Val He Arg He Met Arg Arg He Leu Pro Ala His Ala Lys 
20 25 30 

ate tea gac gac gcg aag gag acg ate cag gag tgc gtg tct gag tac 
He Ser Asp Asp Ala Lys Glu Thr He Gin Glu Cys Val Ser Glu Tyr 
35 40 45 

ate agt ttc ate acg gcg gag gcg aac gag egg tgc cag egg gag cag 
He Ser Phe He Thr Ala Glu Ala Asn Glu Arg Cys Gin Arg Glu Gin 
50 55 60 

egg aag acg gtg acc gca gag gat gtg ttg tgg gcg atg gag aag ctt 
Arq Lys Thr Val Thr Ala Glu Asp Val Leu Trp Ala Met Glu Lys Leu 
65 70 75 80 

ggc ttt gac aac tac get cac cct etc tct ctt tac ctt cac cgc tac 
Gly Phe Asp Asn Tyr Ala His Pro Leu Ser Leu Tyr Leu His Arg Tyr 



85 90 95 

cgc gag agt gaa gga gaa cct get tct gtc aga cgc get tct tct gca 
Arg Glu Ser Glu Gly Glu Pro Ala Ser Val Arg Arg Ala Ser Ser Ala 
100 105 HO 



gac gat cat aac get get tct gga tct ggt ggt ttt gtt gcg cct ttt 
Asp Asp His Asn Ala Ala Ser Gly Ser Gly Gly Phe Val Ala Pro Phe 
145 150 155 160 



48 



96 



144 



192 



240 



288 



336 



384 



atg ggg ate aat aat aat atg gtg cac cca cct tat att aat tct cat 
Met Gly He Asn Asn Asn Met Val His Pro Pro Tyr He Asn Ser His 
115 120 125 

ggc ttt gga atg ttt gat ttt gac cca tea teg caa ggg ttt tac agg 432 
Gly Phe Gly Met Phe Asp Phe Asp Pro Ser Ser Gin Gly Phe Tyr Arg 
130 135 140 



480 
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gat cct tat get aac ate aaa cgt gat gec ctg tgatcatgta agaacaacaa 
Asp Pro Tyr Ala Asn He Lys Arg Asp Ala Leu 
165 I 70 

ctaqtgcatg ctgettttte acttggttag ttatattcaa gcacaagcac atgcaggtgc 

aSScaac? at?tagcttc atctacaaat cttttttcct ctcttcttct catgetttaa 

Sum acaatacttg ttattcattg ttatgctcaa ttgetagett ctattcatcg 

tcgactgLt gtattgttga tgttcattac agtaacagat aagatggtaa ctgetttact 

acttcaaaaa aaaaaaaaaa aaa 



533 



593 
653 
713 
773 
796 



<210> 20 

<211> 171 

<212> PRT 

<213> Glycine max 



Ala Ar^Jlla Met Ala Gly Val Arg Glu Gin Asp Gin Tyr Met Pro He 

1 5 10 15 

Ala Asn Val lie Arg lie Met Arg Arg He Leu Pro Ala His Ala Lys 

20 25 30 

He Ser Asp Asp Ala Lys Glu Thr lie Gin Glu Cys Val Ser Glu Tyr 

35 40 45 

lie Ser Phe He Thr Ala Glu Ala Asn Glu Arg Cys Gin Arg Glu Gin 

50 55 60 

Arg Lys Thr Val Thr Ala Glu Asp Val Leu Trp Ala Met Glu Lys Leu 
65 70 ^5 

Gly Phe Asp Asn Tyr Ala His Pro Leu Ser Leu Tyr Leu His Arg Tyr 

85 9° 95 . 

Arg Glu Ser Glu Gly Glu Pro Ala Ser Val Arg Arg Ala Ser Ser Ala 

100 105 110 

Met Gly He Asn Asn Asn Met Val His Pro Pro Tyr lie Asn Ser His 

115 120 I 25 

Gly Phe Gly Met Phe Asp Phe Asp Pro Ser Ser Gin Gly Phe Tyr Arg 

130 135 I 40 

Asp Asp His Asn Ala Ala Ser Gly Ser Gly Gly Phe Val Ala Pro Phe 
145 150 !55 

Asp Pro Tyr Ala Asn He Lys Arg Asp Ala Leu 
165 170 



<210> 21 
<211> 1098 
<212> DNA 

<213> Triticum aestivum 



<220> 

<221> CDS 

<222> (55) . . - (894) 



<400> 21 _ s7 

gcacgagcaa gtgcgagtgc gactacctgc attgeacett ggctagccct agac atg 



Met 
1 



aaa aac qac ggc gtc ccc aac gga cca gcg gcg ccg gca cct acc cag 
Til Asn Lp G?y Sal Pro Asn Gly Pro Ala Ala Pro Ala Pro Thr Gin 
5 10 15 



105 



153 
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201 

y wv- —v--, 

Ala Lys He Ser 

40 45 



gtg ate cgc ate atg cgc cgt gcg etc cct gec cac gec aag ate tec 
Val He Arg He Met Arg Arg Ala Leu Pro Ala His Ala Lys He Ser 
35 



qac gac gee aag gag gcg att cag gaa tgc gtg tec gag ttc ate age 
Asp Asp Ala Lys Glu Ala He Gin Glu Cys Val Ser Glu Phe lie Ser 
50 55 60 65 

ttc gtc acc ggc gag gec aac gaa egg tgc cgc atg cag cac cgc aag 
Phe Val Thr Gly Glu Ala Asn Glu Arg Cys Arg Met Gin His Arg Lys 
70 75 80 

acc gtc aac gec gaa gac ate gtg tgg gec eta aac cgc etc ggc ttc 
Thr Val Asn Ala Glu Asp He Val Trp Ala Leu Asn Arg Leu Gly Phe 
85 90 95 

cac gac tac gtc gtg ccc etc age gtc ttc ctg cac cgc atg cgc gac 
Asp Asp Tyr Val Val Pro Leu Ser Val Phe Leu His Arg Met Arg Asp 
P 100 105 HO 

ccc gag gcg ggg aca ggt ggt gec get gca ggc gac age cgc gee gtg 
Pro Glu Ala Gly Thr Gly Gly Ala Ala Ala Gly Asp Ser Arg Ala Val 
115 120 125 

acq agt gcg cct ccc cgc gcg gec ccg ccc gtg ate cac gee gtg ccg 
Thr Ser Ala Pro Pro Arg Ala Ala Pro Pro Val He His Ala Val Pro 
130 135 140 145 

ctg cag get cag cgc ccg atg tac gcg ccc ccg get ccg ttg cag gtt 
Leu Gin Ala Gin Arg Pro Met Tyr Ala Pro Pro Ala Pro Leu Gin Val 
150 155 I 60 

gag aat cag atg cag egg cct gtg tac get ccc ccg get ccg gtg cag 
Glu Asn Gin Met Gin Arg Pro Val Tyr Ala Pro Pro Ala Pro Val Gin 



165 



170 175 



gtt cag atg cag egg ggc ate tat ggg ccc egg get cca gtg cac ggg 
Val Gin Met Gin Arg Gly He Tyr Gly Pro Arg Ala Pro Val His Gly 
180 185 190 

tac gec gtc gga atg gcg ccc gtg egg gec aac gtc ggc ggg cag tac 
Tyr Ala Val Gly Met Ala Pro Val Arg Ala Asn Val Gly Gly Gin Tyr 
195 200 205 

cag gtg ttc ggc gga gag ggt gtc atg gee cag caa tac tac ggg tac 
Gin Val Phe Gly Gly Glu Gly Val Met Ala Gin Gin Tyr Tyr Gly Tyr 
210 215 220 225 

ggg tac gag gaa gga gcg tac ggc gca ggt age age aac gga gga gee 
Gly Tyr Glu Glu Gly Ala Tyr Gly Ala Gly Ser Ser Asn Gly Gly Ala 
230 235 240 

gec att ggc gac gag gag age teg tec aac ggc gtg ccg gca ccg ggg 
Ala He Gly Asp Glu Glu Ser Ser Ser Asn Gly Val Pro Ala Pro Gly 
245 250 255 

gag ggc atg ggg gag cca gag cca gag cca gca gca gaa gaa teg cat 
Glu Gly Met Gly Glu Pro Glu Pro Glu Pro Ala Ala Glu Glu Ser His 
260 265 270 



249 



297 



345 



393 



441 



489 



537 



585 



633 



681 



729 



777 



825 



873 
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gac aag ccc gtc caa tct ggc tagtcgcgtg cgcggcgcgc gttagcttct 
Asp Lys Pro Val Gin Ser Gly 
275 280 

gcgtcctgtg tactgtaata atttgccgtg tcgatccggc catggtttgt gtgtgcgtag 
tgcttatcta atgtgggctt gtcctctagt aattcatgta ttgcttatct aatgtggact 
tgtcctctag taattcatgt actctttgct gttgaaaaaa aaaaaaaaaa aaaa 

<210> 22 
<211> 280 
<212> PRT 

<213> Triticum aestivum 
<400> 22 

Met Glu Asn Asp Gly Val Pro Asn Gly Pro Ala Ala Pro Ala Pro Thr 

1 5 10 15 

Gin Gly Thr Pro Val Val Arg Glu Gin Asp Arg Leu Met Pro He Ala 

20 25 30 

Asn Val He Arg He Met Arg Arg Ala Leu Pro Ala His Ala Lys He 

35 40 45 

Ser Asp Asp Ala Lys Glu Ala He Gin Glu Cys Val Ser Glu Phe He 

50 55 60 

Ser Phe Val Thr Gly Glu Ala Asn Glu Arg Cys Arg Met Gin His Arg 
65 70 75 80 

Lys Thr Val Asn Ala Glu Asp He Val Trp Ala Leu Asn Arg Leu Gly 

85 90 95 

Phe Asp Asp Tyr Val Val Pro Leu Ser Val Phe Leu His Arg Met Arg 

100 105 HO 

Asp Pro Glu Ala Gly Thr Gly Gly Ala Ala Ala Gly Asp Ser Arg Ala 

115 120 125 

Val Thr Ser Ala Pro Pro Arg Ala Ala Pro Pro Val He His Ala Val 

130 135 140 

Pro Leu Gin Ala Gin Arg Pro Met Tyr Ala Pro Pro Ala Pro Leu Gin 
145 150 155 160 

Val Glu Asn Gin Met Gin Arg Pro Val Tyr Ala Pro Pro Ala Pro Val 

165 170 175 

Gin Val Gin Met Gin Arg Gly He Tyr Gly Pro Arg Ala Pro Val His 

180 185 190 

Gly Tyr Ala Val Gly Met Ala Pro Val Arg Ala Asn Val Gly Gly Gin 

195 200 205 

Tyr Gin Val Phe Gly Gly Glu Gly Val Met Ala Gin Gin Tyr Tyr Gly 

210 215 220 

Tvr Glv Tyr Glu Glu Gly Ala Tyr Gly Ala Gly Ser Ser Asn Gly Gly 
225 230 235 240 

Ala Ala He Gly Asp Glu Glu Ser Ser Ser Asn Gly Val Pro Ala Pro 

245 250 255 

Glv Glu Gly Met Gly Glu Pro Glu Pro Glu Pro Ala Ala Glu Glu Ser 

260 265 270 

His Asp Lys Pro Val Gin Ser Gly 
275 280 

<210> 23 
<211> 65 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> LEC1 consensus protein sequence 

<221> VARIANT 
<222> {!)... (65) 



924 



984 
1044 

1098 
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<223> Xaa - Any Amino Acid 

Arg Glu' Gin Asp Xaa Xaa Met Pro lie Ala Asn Val lie Arg lie Met 

1 5 1° 15 

Arg Xaa Xaa Leu Pro Xaa His Ala Lys lie Ser Asp Asp Ala Lys Glu 

20 25 30 

Xaa He Gin Glu Cys Val Ser Glu Tyr lie Ser Phe Xaa Thr Xaa Glu 

35 40 45 

Ala Asn Xaa Arg Cys Xaa Xaa Xaa Xaa Arg Lys Thr Xaa Xaa Xaa Glu 
50 55 60 

Xaa 
65 
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This International Searching Authority found multiple (groups of) 
inventions in this international application, as follows: 

1. Claims: 1-62 all partially 

An isolated nucleic acid comprising a member of the group 
consisting of those as embodied in claim 1 of the 
application, especially having the sequence set forth in SEQ 
ID NO: 1,13,15, being a polynucleotide having a plant 
HAP3-type ccaat-box transcriptional activator derived from 
maize, fragments, functional homologs, hybridizing and 
complementary sequences thereof. 

Vectors, expression cassettes, host cells, and transgenic 
plants comprising said nucleic acid sequence. 
A protein encoded thereby, especially as set forth In SEQ ID 
N0:2,14,16 and bearing the conserved motifs of SEQ ID N0:23, 
the protein being a LEC1 transcriptional activator. 
Methods of using a nucleic acid sequence or a protein as 
defined above to modulate LEC1 acitivity in a plant, to 
enhance tissue culture response in a plant cell, or to 
induce somatic embryogenesis in a plant cell other than an 
Arabidopsis cell . 

Methods for positive selection of a transformed cell, for 
inducing apomixis, for increasing transformation efficiency, 
and for increasing recovery of regenerated plants comprising 
the use of an LEC1 polynucleotide as defined above. 



2. Claims: 1-62 all partially 

idem for SEQ ID N0:7,8, the sequences being derived from 
Argemone 



3. Claims: 1-62 all partially 

idem for SEQ ID N0:9, 10,17-20, the sequences being derived 
from soybean 



4. Claims: 1-62 all partially 

idem for SEQ ID NO: 11, 12, the sequences being derived from 
Veronia 



5. Claims: 1-62 all partially 

idem for SEQ ID N0:21,22, the sequences being derived from 
wheat 
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